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Abstract 

We extend the 0- approximation of sensing actions and incomplete information in l Son and Ba ral 200H 
to action theories with static causal laws and prove its soundness with respect to the possible world 
semantics. We also show that the conditional planning problem with respect to this approximation is 
NP-complete. We then present an answer set programming based conditional planner, called ASCP, 
that is capable of generating both conformant plans and conditional plans in the presence of sensing 
actions, incomplete information about the initial state, and static causal laws. We prove the cor- 
rectness of our implementation and argue that our planner is sound and complete with respect to the 
proposed approximation. Finally, we present experimental results comparing ASCP to other planners. 

KEYWORDS: Reasoning about Actions and Changes, Sensing Actions, Incomplete Information, 
Conformant Planning, Conditional Planning, Answer Set Programming 



1 Introduction 

Classical planning assumes that agents have complete information about the world. For this 
reason, it is often labeled as unrealistic because agents operating in real- world environment 
often do not have complete information about their environment. Two important questions 
arise when one wants to remove this assumption: how to reason about the knowledge of 
agents and what is a plan in the presence of incomplete information. The first question 
led to the development of several approaches to reasoning about effects of sensing (or 
knowledge producing) actions (IGolden and Weld 1996bl ILoboetal. 19971 IMoore 19851 
|Scherl and Levesque 2003| ISon and Baral 200T1 IThielscher 2000bl l. The second question 
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led to the notions of conditional plan and conformant plan whose execution is guaranteed 
to achieve the goal regardless of the values of unknown fluents in the initial situation. The 
former contains sensing actions and conditionals such as the well-known "if-then-else" or 
"cases" construct, while the latter is just a sequence of actions. In this paper, we refer to 
conditional planning and conformant planning as planning approaches that generate con- 
ditional plans and conformant plans, respectively. We use plan as a generic term for both 
conditional and conformant plan when the distinction between the two is not important. 

Approaches to conditional planning can be characterized by the techniques employed 
in their search process or by the action formalism that supports their reasoning process. 
Most of the early conditional planners implemented a partial-order planning algorithm 
(IGolden 1998IIGolden et al. 1996al|Pryor and Collins 1996||Peotand~S mith 1992) and used 
Situation Calculus or STRIPS as their underlying formalism in representing and reason- 
ing about actions and their effects. Among them, CoPlaS ( ILobo 19 98 1, which is imple- 
mented in Sicstus Prolog, is a regression planner that uses a high-level action description 
language to represent and reason about effects of actions, including sensing actions; and 
FLUX (Thielscher 2000a), a constraint logic programming based planner, is capable of 
generating and verifying conditional plans. Another conditional planner based on a QBF 
theorem prover was developed in ( Rintanen 2000 1. Some other planners, for example, SGP 
(IWeld et al. 1998i or POND QBryce et al. 2004) , extended the planning graph algorithm 
dBlum and Furst 95l to deal with sensing actions. The main difference between SGP and 
POND is that the former searches solutions within the planning graph, whereas the latter 
uses it as a means of computing the heuristic function. 

Conformant planning ( B onet and G effner 2000 Braf man"and Hoffmann 2 004 Ci matti et al. 2004 
ICastellini et al. 20031 Eiter et al. 20031 ISmith and Weld 19981 is another approach to deal 
with incomplete information. In conformant setting, a solution is simply a sequence of ac- 
tions that achieves the goal from every possible initial situation. A recent study ( Ci matti et al. 2 004 1 
shows that conformant planning based on model checking is computationally competitive 
with other approaches to conformant planning such as those based on heuristic search 
algorithms (IBonet and Ge ffner 2000 Brafm an and Hoffmann 20 04 1 or those that extend 
Graphplan ( S mith and Weld 19 98 1. A detailed comparison in (Eite r et al. 200 3 1 demon- 
strates that a logic programming based conformant planner is able to compete with other 
approaches to planning. 

The most important difference between conditional planners and conformant planners 
lies in the fact that conditional planners can deal with sensing actions whereas confor- 
mant planners cannot. Consequently, there are planning problems solvable by conditional 
planners but not by conformant planners. The following example demonstrates this issue. 

Example 1 

Consider a security window with a lock that behaves as follows. The window can be in 
one of the three states opened, closed 1 or locked 2 . When the window is closed or opened, 
pushing it up or down will open or close it respectively. When the window is closed or 
locked, flipping the lock will lock or close it respectively. 

1 The window is closed and unlocked. 

2 The window is closed and locked. 
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Now, consider a security robot that needs to make sure that the window is locked after 
9 pm. Suppose that the robot has been told that the window is not open (but whether it is 
locked or closed is unknown). 

Intuitively, the robot can achieve its goal by performing the following steps. First, (1) it 
checks the window to determine the window's status. If the window is closed, (2. a) it locks 
the window; otherwise (i.e., the window is already locked), simply (2.b) it does nothing. 

Observe that no sequence of actions can achieve the goal from every possible initial 
situation. In other words, there exists no conformant plan achieving the goal. □ 

In this paper, we investigate the application of answer set programming (see e.g. (Bara l 20031 
Lifsc hrtz 20021 |Marek and Tru szczyhs ki 1999| INiemela 1999» in conformant and condi- 
tional planning. To achieve our goal, we first define an approximation semantic for ac- 
tion theories with static causal laws and sensing actions based on the 0-approximation 
in (ISon and Baral 2001). It is an alternative to the possible world semantics for reason- 
ing about effects of actions in the presence of incomplete information and sensing actions 
dMoore 198 5 1. The basic idea of this approach is to approximate the set of possible world 
states by a set of fluent literals that is true in every possible world state. The main advantage 
of the approximation-based approach is its low complexity in reasoning and planning tasks 
(NP-complete) comparing to those based on the possible world semantics X^P-complete 
(IBaral et al. 2000al . The trade-off for this low complexity is incompleteness. As we will 
demonstrate in our experiments, this is not really an issue with the benchmarks in the lit- 
erature. 

We prove that the entailment relationship for action theories based on this approxima- 
tion is sound with respect to the possible world semantics for action theories with incom- 
plete initial situation. We then show that the planning problem with respect to the newly 
developed approximation is NP-complete. This facilitates the development of ASCP, an 
answer set programming based planner that is capable of generating both conditional and 
conformant plans. Given a planning problem instance with incomplete information about 
the initial situation and sensing actions, we translate it into a logic program whose an- 
swer sets ( Gel fond and Lifschitz 198 8 1 - which can be computed using existing answer set 
solvers (e.g. cmodels JLierler and Maratea 2004b . smodels JSimons et al. 20021 . dlv 
HCitrigno et al. 1997) , ASSAT dLin and Zhao 2002b . NoMore ( |Anger, et al. 2002b , etc.) - 
correspond to conformant or conditional plans that satisfy the goal. We compare our plan- 
ner against state-of-the-art planners. The results of our experiments show that conditional 
and conformant planning based on answer set programming can be competitive with other 
approaches. To the best of our knowledge, no answer set based conditional planner has 
been developed except a previous version of the planner presented in an earlier version of 
this paper dSon et al. 20041 . 

The paper is organized as follows. Section |2]presents the basics of an action language 
with sensing actions and static causal laws, including its syntax and the 0-approximation, 
as well as the notions of conditional plans and queries. It also contains the complexity 
result of the conditional planning problem with respect to the 0-approximation. Section[3] 
describes a logic programming encoding of a conditional/conformant planner, called ASCP. 
Section|4]discusses several properties of ASCP. Section|5]experimentally compares ASCP 
with some other state-of-the-art conformant/conditional planners. Section[6]discusses some 
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desirable extensions of the current work. The proofs of theorems and propositions are given 
in Appendices A and B. An example of encoding is given in Appendix C. 

2 A C K — An Action Language with Sensing Actions and Static Causal Laws 

The representation language, A° K , for our planner is an extension of the action language 
Ak in (IS on and Baral 2001 1. While Ak extends the high-level action description language 
A from ( Gelfon dand Lifschitz 19931 by introducing two new types of propositions called 
knowledge producing proposition and the executability condition, A C K extends Ak by 
adding static causal laws and allowing a sensing action to sense more than one fluent. 
Loosely speaking, A C K is a subset of the language Cos in (Ba ral et al. 20 00b). Neverthe- 
less, like Ak, £ds considers sensing actions that sense only one fluent. The semantics 
given for A C K in this paper is an approximation of the semantics of Cds- 



The alphabet of an action theory in A C K consists of a set of actions A and a set of fluents 
F. A fluent literal (or literal for short) is either a fluent / £ F or its negation -if. f and 
->/ are said to be complementary. For a literal I, by -<l, we mean its complement. A fluent 
formula is a propositional formula constructed from the set of literals using operators A, 
V, and/or -i. To describe an action theory, propositions of the following forms are used: 



where a <E A is an action, I is a literal, and ip, (/>, <p, 8 are sets of literals 3 . 

The initial situation is described by a set of propositions Q, called v-propositions. Q 
says that I holds in the initial situation. A proposition of form (j2ji is called executability con- 
dition. It says that a is executable in any situation in which ip holds (the precise meaning of 
hold will be given later). A proposition Q, called a dynamic causal law, represents a con- 
ditional effect of an action. It says that performing a in a situation in which <f) holds causes 
/ to hold in the successor situation. A proposition (j4ji, called a static causal law, states that 
/ holds in any situation in which ip holds. A knowledge proposition (or k-proposition for 
short) states that the values of literals in 8, sometimes referred to as sensed-literals, will 
be known after a is executed. Because the execution of a will determine the truth value of 
at least one fluent, without loss of generality, we assume that 8 contains at least two liter- 
als. Furthermore, we require that if 8 is not a set of two contrary literals / and -if then the 
literals in 8 are mutually exclusive, i.e., 

3 A set of literals is interpreted as the conjunction of its members. The empty set denotes true. 



2.1 Action Language A C K - Syntax 



initially (/) 
executable(a, ip) 
causes(a, I, <p) 

determines(a, 8) 



(1) 
(2) 
(3) 
(4) 
(5) 
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1. for every pair of literals g and g' in 9, g ^ g', the theory contains the static causal 
law 



if(V,M) 



and 



2. for every literal g in 9, the theory contains the static causal law 

ifG?,W I s'e0\ {<?}})• 

For convenience, we use the abbreviation 

oneof(6>) 

to denote the above set of static causal laws. Apart from this, we will sometime write 

determines(a, /) 

to stand for 

determines(a, {/, ->/}). 

Actions appearing in <E} and (0 are called non-sensing actions and sensing actions, 
respectively. In this paper, we assume that they are disjoint from each other. In addition, 
we also assume that each sensing action appears in at most one k-proposition. 

An action theory is given by a pair (V, I) where T> is a set of propositions (|2j-l|5) and X 
is a set of propositions Q. V and T are called the domain description and initial situation, 
respectively. A planning problem instance is a 3-tuple (D,I, Q), where (T>,X) is an action 
theory and Q is a conjunction of fluent literals. It is worth mentioning that with a proper 
set of rules for checking the truth value of a fluent formula (see e.g. (ISon et al. 20 05a i). 
the framework and all results presented in this paper can be extended to allow Q to be an 
arbitrary fluent formula as well. 

Example 2 

The planning problem instance Pi — (Vi,Ii,Gi) in Example n can be represented as 
follows. 

executable(c/iecfc, {}) 
executable(pws/i_itp, {closed}) 
executable(push_down, {open}) 
executable( flip Jock, {^open}) 



causes(push_down, closed, {}) 
causes (pushjup, open, {}) 
causes( flip Jock, locked, {closed}) 
causes( flip Jock, closed, {locked}) 

oneof({open, locked, closed}) 

determines(c/iecA:, {open, closed, locked}) 
Tx — { initially(-iopen) } 

Qi = {locked} 
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□ 

Remark 1 

For an action theory (T>, X), if (I, 0) G V implies that literal I holds in every situation. Since 
I is always true, queries about the truth value of I (or -iZ) have a trivial answer and the theory 
can be simplified by removing all instances of I in other propositions. Furthermore, if the 
theory also contains a dynamic law of the form causes (a, ->Z, </>) then the execution of a in 
a state satisfying </> will result in an inconsistent state of the world. Thus, the introduction 
of I in the action theory is either redundant or erroneous. For this reason, without loss of 
generality, we will assume that action theories in this paper do not contain any static causal 
law © with <p = 0. 

Remark 2 

Since an empty plan can always be used to achieve an empty goal, we will assume hereafter 
that planning problem instances considered in this paper have non-empty goals. 

2.2 Conditional Plan 

In the presence of incomplete information and sensing actions, we need to extend the 
notion of a plan from a sequence of actions so as to allow conditional statements such 
as if-then-else, while-do, or case-endcase (see e.g. ( |Levesque 1996| ILobo et al. 19971 
ISon and Baral 20011 '). Notice that an if-then-else statement can be replaced by a case- 
endcase statement. Besides, if we are only interested in plans with bounded length then 
whatever can be represented by a while-do statement with a non-empty body can also be 
represented by a set of case-endcase statements as well. Therefore, in this paper, we limit 
ourselves to conditional plans with the case-endcase construct only. Formally, we con- 
sider conditional plans defined as follows. We note that our notion of conditional plans 
in this paper is fairly similar to the ones introduced in (Levesq ue 1996| ILobo et al. 19971 
ISon and Baral 200 1> . 

Definition 1 {Conditional Plan) 

1. [] is a conditional plan, denoting the empty plan, i.e., the plan containing no action. 

2. if a is a non-sensing action and p is a conditional plan then [a;p] is a conditional 
plan. 

3. if a is a sensing action with proposition where — {gx, . . . , g n }, and p 3 's are 
conditional plans then [a; cases({gj — > Pj}" =1 )] is a conditional plan. 

4. Nothing else is a conditional plan. 

By this definition, clearly a sequence of actions is also a conditional plan. The execution 
of a conditional plan of the form [a;p], where a is a non-sensing action and p is another 
conditional plan, is done sequentially, i.e., a is executed first, followed by p. To execute 
a conditional plan of the form [a; cases({g., — > Pj}" =1 )], we first execute a and then 
evaluate each gj with respect to our current knowledge. If one of the g 's, say g^, holds, 
we execute the corresponding sub-plan pk- Observe that because fluent literals in 8 are 
mutual exclusive, such g^ uniquely exists. 
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Example 3 

The following are conditional plans of the action theory in Example[2] 

Pi = [pushjlown; flipjock] 

open — > 

P2 = check; cases | closed — > [flipjock] 
locked — > 

open — > [push-down; flipjock] 
p.3 = check; cases I closed — > [flipjock; flipjock; flipjock] 
locked — > ] 

(open — > [] 
closed — > j<2 
locked — > [] 

Among those, p2> £>3 an d P4 are conditional plans that achieve the goal C?i 4 . □ 
In the rest of the paper, the terms "plan" and "conditional plan" will be used alternatively. 

2.3 Queries 

A query posed to an .A^ action theory (T>, T) is of the form 

knows p after p (6) 

or 

whether p after p (7) 

where p is a conditional plan and p is a fluent formula. Intuitively, the first (resp. second) 
query asks whether p is true (resp. known) after the execution of p from the initial situation. 



2.4 O-Approximation Semantics of A C K 

We now define an approximation semantics of A C K , called O-approximation, which extends 
the O-approximation in (ISon and Baral 200T1 to deal with static causal laws. It is defined 
by a transition function $ that maps actions and a-states into sets of a-states (the meaning 
of a-states will follow). Before providing the formal definition of the transition function, 
we introduce some notations and terminology. 

For a set of literals a, ->cr denotes the set {-*l | I G a}, a is said to be consistent if it 
does not contain two complementary literals. A literal I (resp. set of literals 7) holds in a 
set of literals a if I 6 a (resp. 7 C a); I (resp. 7) possibly holds in a if ->l ^ a (resp. 
-,7 n a = 0). 

Given a consistent set of literals a, the truth value of a formula p, denoted by a(p), is 
defined as follows. If p = I for some literal I then a(p) — T if I G a; a(p) = F if -il G cr; 
cr(p) = unknown otherwise. If p = pi A P2 then cr(jo) = T if (r(pi) = T and a{pi) = T; 

4 Note that p2 and p4 can achieve the goal because the first case "the window is open" cannot happen 
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a(p) = F if u(pi) = F or u(p2) = F; a(p) = unknown otherwise. If p = p\ V P2 
then a(p) = T if er(pi) = T or a(p 2 ) = T; a(p) = F if a{pi) = F and a(p 2 ) = F; 
(i(p) = unknown otherwise. If p = — >pi then a(p) = T if <r(pi) = F; a(p) = F if 
cr(pi) = T; a(p) = unknown otherwise. 

We say that p is known to be true (resp. false) in a and write a |= p (resp. a |= -^p) 
if a(p) — T (resp. cr(p) = F). When a |= p or er |= ^p we say that p is known in cr; 
otherwise, p is unknown in cr. We will say that p holds in a if it is known to be true in a. 

A set of literals a satisfies a static causal law @ if either (i) ip does not hold in a; or 
(ii) / holds in a (i.e., ip holds in a implies that / holds in a). By CIt>{<j), we denote the 
smallest set of literals that includes a and satisfies all static causal laws in T>. Note that 
CIt>{&) might be inconsistent but it is unique {see Lemma[0 Appendix A). 

An interpretation I of a domain description V is a complete and consistent set of literals 
in V, i.e., for every fluent / 6 P, (i) / € I or -./ € /; and (ii) {/, ->/} <g /. 

A ifafe s is an interpretation satisfying all static causal laws in V. An action a is ex- 
ecutable in s if there exists an executability condition (0 such that tp holds in s. For a 
non-sensing action a executable in s, let 



The set E(a, s) is often referred to as the direct effects of a. When the agent has complete 
information about the world, the set of possible next states after the execution of a in s, 
denoted by Res^(a, s), is defined as follows. 

Definition 2 (Possible Next States, (McCain and Turner 1995)) 

Let V be a domain description. For any state s and non-sensing action a executable in s, 

Res^la, s) = {s' \ s' is a state such that s' = Cl v (E(a, s) U (s n s'))}. 

The intuitive meaning of this definition is that a literal / holds in a possible next state s' 
of s after a is executed iff either (i) it is a direct effect of a, i.e., I £ -E(a, s) (ii) it holds 
by inertia, i.e., I e (s PI s'), or (iii) it is an indirect effect 5 of a, i.e., Z holds because of the 
operator Cl-p- 

Note that the i?esp-function can be non-deterministic, i.e., Res^(a, s) might contain 
more than one element. The following example illustrates this point. 

Example 4 

Consider the following domain description 



Let s = {—if, ->g, -ih, k}. Clearly s is a state since it satisfies all static laws in T>2- Execut- 
ing a in s results in two possible next states 



E(a, s) = {I | 3 a dynamic causal law @ such that cf> holds in s} 



(8) 



executable(a, {}) 



causes(a, /, {}) 
V 2 = \ tt{g,{f,^h}) 



} 



I »(*,{-/}) 



Resv 2 {a,s) = {{f,^g,h,k},{f,g,^h,k}} 



5 Indirect effects are those caused by static causal laws. 
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In the first possible next state s% = {/, -><?, h, k}, / holds because it is a direct effect of 
a, i.e., / G E(a, s); -u? and k hold because of inertia (s n s\ — {^g, k}); and h holds 
because it is an indirect effect of a (in particular, h holds because of the static causal law 

if(M/, -<?}))■ 

Likewise, we can explain why each literal in the second possible next state holds. □ 
Definition 3 (Consistent Domains) 

A domain description T> is consistent if for every state s and action a executable in s, 

Res^(a,s) ^ 0. 

In the presence of incomplete information, an agent, however, does not always know ex- 
actly which state it is currently in. One possible way to deal with this problem is to repre- 
sent the agent knowledge by a set of possible states (a.k.a. belief state) that are consistent 
with the agent's current knowledge and extend Definition[2]to define a mapping from pairs 
of actions and belief states into belief states as in (Ba ral et al. 2000bV The main problem 
with this approach is its high complexity (B aral et al. 2000al . even for the computation of 
what is true/false after the execution of one action. We address this problem by defining an 
approximation of the set of states in Definition[2]as follows. 

First, we relax the notion of a state in Definition|2]to be an approximate state defined as 
follows. 

Definition 4 (Approximate State) 

A consistent set of literals 5 is called an approximate state (or a-state, for short) if 5 satisfies 
all static causal laws in V. 

Intuitively, 5 represents the (possibly incomplete) current knowledge of the agent, i.e., it 
contains all fluent literals that are known to be true to the agent. When 5 is a subset of some 
state s, we say that it is valid. An action a is executable in 5 if there exists an executability 
condition (0 in T> such that ip holds in 5. 

Next, we define what are the possible next a-states after the execution of an action a in a 
given a-state 5, provided that a is executable in i5. Consider the case that a is a non-sensing 
action. Let 

e(o, 5) — Clx>{{l | 3 a dynamic causal law (JSJ such that <fi holds in S}) (9) 

and 

P<a,S) =lX 2> ci M) (10) 

where 

pc (a, S) = {I | 3 a dynamic causal law |3j s.t. I g" S and </> possibly holds in 5} (11) 
and for i > 0, 

pc i+1 (a,S) = pc i (a,S)U {l\3a static causal law (0 s.t. / £ S,(pnpc i (a,S) ^ 0, 

and ip possibly holds in e (a. 5)} (12) 

Intuitively, e(a, 5) and pc(a, S) denote what definitely holds and what may change in the 
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next situation respectively 6 . Specifically, I € e(a, S) means that I holds in the next situation 
and I £ pc(a, 5) means that I is not in 6 but possibly holds in the next situation. This 
implies that 5 \ ~^pc(a, 5) is an approximation of the set of literals that hold by inertia after 
the execution of a in S. Taking into account the effects of the static causal laws, we have 
that the set of literals 6' = Clj)(e(a, S)U(S\^pc(a, 5))) must hold in the next situation. 
This leads us to the following definition of the possible next a-states after a non-sensing 
action gets executed. 

Definition 5 (O-Result Function) 

For every a-state 5 and non-sensing action a executable in 5, let 

6' = Cl v (e(a, 5)U(6\^pc(a, 5))). 

Define 

1. Resx>(a, 6) — {5'} if 6' is consistent. 

2. Resj}(a, 5) = if 5' is inconsistent. 

The next examples illustrate this definition. 

Example 5 

Consider the domain description V\ in Example|2] Let S = {^open, closed, ^locked}. 
We can easily check that 6 is an a-state of T>\ . We have 

e(f lip Jock, S) — Clx> 1 {{locked}) — {->open,-<closed, locked} 

and 

pc° (flip Jock, 5) — {locked} 
Because if(^open, {locked}) £ T>\, and ii(-iclosed, {locked}) £ T>\, by \\2l . we have 

pc 1 (flip Jock, S) = {locked, ^closed} 

Note that -^open £ pc 1 (flip Jock, S) because it is already in S. 

It is easy to see that pc l ( flip Jock, 5) = pc 1 (flip Jock, S) foralH > 1. Hence, we have 

oo 

pc( flip Jock, 6) = pc 1 (flip Jock, S) = {^closed, locked} 

i=o 

Accordingly, we have 

Resv t (flip Jock, S) = {Clx> 1 (e(f lip Jock, 5) U (S \ ~^pc(f lip Jock, 5)))} = 

{Clx> 1 ({~<open, -^closed, locked})} — {{^open, -^closed, locked}} 

□ 



Note that the operator Cl-p is used in the definition of e(a, 6) to maximize what definitely holds in the next 
situation. 
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Example 6 

For the domain description T>2 in Example|4] we have 

e(a,s)=Cl V2 ({/}) = {/} 

pc°(a,s) = {f} 
As iS(g, {/, -nh}) G X> 2 and if(/i, {/, -.3}) G V 2 , we have 

pc^a^s) = {f,g,h} 

Note that fc g" pc 1 (a, s) since -1/ does not hold in e(a, s). We can check that pc l (a, s) — 
pc 1 (a, s) for all i > 1. Hence, we have 

pc(a,s) = {f,g,h} 

As a result, we have 

fles C2 (a, s) = {CTp 2 (e(a, a) U (s \ ^c(a, s)))} = {C/p 2 ({/, fc})} = {{/, &}} 

□ 

The following proposition shows that when a non-sensing action is executed, the .Res- 
function is deterministic in the sense that it returns at most one possible next a-state; fur- 
thermore, it is "sound" with respect to the i?es c -function. 

Proposition 1 

Let T> be a consistent domain description. For any state ,s, a-state <5 C s, and non-sensing 
action a executable in S, there exists an a-state 6' such that (i) Res-z>(a, 5) = {5'}, and (ii) 
5 1 is a subset of every state s' G Res^,(a, s). 

Proof 

see Appendix A. □ 

We have specified what are the possible next a-states after a non-sensing action is per- 
formed. Let us move to the case when a sensing action is executed. Consider an a-state S 
and a sensing action a with k-proposition (0 in V. Intuitively, after a is executed, the agent 
will know the values of literals in 0. Thus, the set of possible next a-states can be defined 
as follows. 

Definition 6 (O-Result Function) 

For every a-state 8 and sensing action a with proposition l|3 such that a is executable in S , 

Res v {a, 8) = {Cl v {8 U {5}) | g G and Cl v {8 U {g}) is consistent} 

Roughly speaking, executing a will result in several possible next a-states, in each of which 
exactly one sensed-literal in 8 holds. However, some of them might be inconsistent with 
what is currently known. For example, if the security robot in ExampleHknows that the 
window is not open then after it checks the window, it should not consider the case that the 
window is open because this is inconsistent with its current knowledge. Thus, in defining 
the set of possible next a-states resulting from the execution of a sensing action, we need 
to exclude such inconsistent a-states. The following example illustrates this. 
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Example 7 

Consider again the domain description T>\ in Example|2]and an a-state 8\ — {^open}. 
We have 

Cl-Dx (81 U {open}) — {open, ->open, closed, -^closed, locked, ->locked} = <5i i 

CIt> 1 (Si U {closed}) — {^open, closed, -^locked} — 81,2 

CI-d^i U {locked}) — {->open,-iclosed, locked} = #1,3 
Among those, 6%^ is inconsistent. Therefore, we have 

Res Vl {check,5\) = {£1,2, ^1,3} 

□ 

The next proposition shows that if a sensing action is performed in a valid a-state then 
the set of possible next a-states will contain at least one valid a-state. This corresponds to 
the fact that if the current knowledge of the world of the agent is consistent with the state 
of the world, it will remain consistent with the state of the world after the agent acquires 
additional knowledge through the execution of a sensing action. 

Proposition 2 

Let V be a consistent domain description. For any a-state 5, and a sensing action a exe- 
cutable in S, if 5 is valid then Resv(a, S) contains at least one valid a-state. 

Proof 

see Appendix A. □ 

The transition function $ that maps actions and a-states into sets of a-states is defined as 
follows. 

Definition 7 (Transition Function) 

Given a domain description T>, for any action a and a-state 5, 

1 . if a is not executable in 6 then 

$(a,5) =_L 

2. otherwise, 

$(a, 5) = Resv(a, 8) 

The transition function $ returns the set of possible next a-states after performing a 
single action in a given a-state. We now extend it to define the set of possible next a-states 
after the execution of a plan. The extended transition function, called $, is given in the 
following definition. 

Definition 8 (Extended Transition Function) 

Given a domain description T>, for any plan p and a-state <5, 



1 . if p = \\ then 



Hp,S) = {8} 
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2. if p = [a; q], where a is a non-sensing action and q is a sub-plan, then 



_L if$(a,5) = ± 

U 5 'e*(a,5) %> 5 ') otherwise 



3. ifp = [a; cases({gj — > Pj}™ =1 )], where a is a sensing action andp/s are sub-plans, 
then 



_L if$(a,<5) = ± 

x Ui< J <„,j' e $(ai), 5j holds in 5' *(pj> <*') otherwise 
where, by convention, ...U_LU... = _L. 

Items (2) and (3) of the above definition deserve some elaboration. 
Remark 3 

During the execution of a plan p, when a non-sensing action a is encountered (Item 2), by 
Definitions |5] and there are three possibilities: $(a, 6) = _L, $(a, <5) = 0, or $(a, <5) = 
{(5'} for some a-state 5' . If the first case occurs then the result of execution of p in S by 
the definition is also _L. In this case, we say that p is not executable in 8; otherwise, p is 
executable in S. If the second case occurs then by the definition, $(p, S) = 0. One may 
notice that, by Proposition^ this case takes place only if there exists no state s such that 
S C s (i.e., 5 is invalid), or the domain is inconsistent. When $(a, 6) = {6'}, then the 
result of the execution of p in 5 is exactly as the result of the execution of the rest of p in 
5'. 



Remark 4 

If p = [a; cases({<7j — > where a is a sensing action and p/s are sub-plans 

(Item 3), and $(a, (5) ^ _L then by Definitions[6]and0 we know that $(a, S) may contain 
several a-states Sj's. Each <5j corresponds to an a-state in which literal gj holds. Therefore, 
we define 5) to be the union of the sets of possible a-states that are the results of the 
execution of pj in Sj. Note that when we add gj to the current state 5 to generate Sj, we 
assume that gj holds. However, if later on, during the execution of the rest of p, which is 
Pj, we discover that <f>(pj ,Sj) = 0, then our assumption about gj is not correct. Therefore, 
such a 5j contributes nothing to the set of possible a-states of $(a, 5). To see how this can 
happen, consider the following domain description 

executable(a, {}) 
executable(6, {}) 

V 3 = ^ causes(6, h, {}) 

if(/, {<?,/*}) 
H(f,{g,^h}) 
determines(a, /) 

and suppose that the set of fluents is {f,g,h}. Let us see what are the final possible 
a-states after the execution of plan p = [a; cases({/ — > b; -if — > b})] in a-state 8 = {g} 
as defined by the extended transition function. 

When a is performed, we generate two possible next a-states Si = {g 7 /}, and 5 2 = 
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{g, ->f}. Executing b in 62 results in no possible next a-state because Clx> 3 ({<?, ->/, h}) = 
{g, -1/, h, /} is not consistent. This means that $(&, 82), and thus $([&], 82), become 0. 
Therefore, the set of possible final a-states is S) = &([b],6i) — {{/, g, h}}. 

Note that in this example, we did not notice that 82 is inconsistent at the time the action 
a was performed. Rather, its inconsistency was only realized after the execution of b. In 
other words, our assumption that -1/ holds was not correct. 

Similarly to the execution of a non-sensing action, when a sensing action a is performed, 
by Proposition^ $(a, 6) = only if the domain is inconsistent or 8 is invalid. 

The above remarks imply that in some cases, for a plan p and an a-state 6, <t(p, S) may be 
empty. Intuitively, this is because either 8 is invalid or the domain is inconsistent. We will 
show that under reasonable assumptions about 8 and the domain, this cannot happen. 

Definition 9 (Consistent Action Theories) 

An action theory (T>,T) is consistent if T> is consistent and its initial a-state, defined by 
CI V {{1 I initially (/) G I}), is valid. 

The next proposition says that the execution of an executable plan from a valid a-state of a 
consistent action theory will result in at least one valid a-state. 

Proposition 3 

Let (V, X) be a consistent action theory and let 5 be its initial a-state. For every conditional 
planp, if 8) _L then <3?(p, S) contains at least one valid a-state. 

Proof 

see Appendix A . □ 

The above proposition implies that if the action theory (T>, T) is consistent and S is its 
initial a-state then the execution of p in 5 will yield at least a valid trajectory 7 , provided 
that p is executable in 6, This is consistent with the fact that if the initial a-state is complete 
(i.e., if we have complete information) then the execution of an executable plan in the 
initial a-state would return a valid trajectory. From now on, we only consider consistent 
action theories. 

We next define the entailment relationship between action theories and queries. 
Definition 10 (Entailment) 

Let (£>, Z) be an action theory and 5 be its initial a-state. For a plan p and a fluent formula 
p, we say that 

• entails the query knows p after p and write 

V |=i knows p after p 

if $(p, S) 7^ J_ and p is true in every a-state in <t(p, 5); and 

• entails the query whether p after p and write 

V |=i whether p after p 

if $(p, S) 7^ J_ and p is known in every a-state in $(p, S). 

7 A trajectory is an alternate sequence of a-states and actions, 5o a l^l a 2 • • ■ CL n 5 n , such that Si € $(0^, 
for i = 1, . . . , n; A trajectory is valid if Si's are valid a-states. 
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Example 8 

For the action theory (Pi ,Xy) in Example|2] we will show that 

hii knows locked after p 2 (13) 

where p2 is given in Example [3] 

Let ^2,1 = 0' P2.2 — [flip-lock] and p2,3 = Q- It is eas y to see that the initial a-state of 
(Pi,2~i) is <5i = {-iopen}. 

It follows from Example0that 

$(check, 8i) = {5i,2, ^1,3} 

On the other hand, we have 

$(p2,2) ^1,2) = {{locked, -^open, ->closed}} 

and 

^(^2,3,^1,3) = {{locked, ^open, ->closed}} 

Therefore, we have 

®(j>2, Si) = $(p2,2, #1,2) u *(P2,3) = {{/oc/ced, -lopen, ->c/osed}} 

Since locked is true in {locked, ^open, ^closed}, we have (1 1 3I > holds. On the other 
hand, because closed is false in {locked, ^open, -^closed}, we have 

fi ^ii knows closed after p 2 but Pi |=x x knows -^closed after p 2 - 
Likewise, we can prove that 

2>i Hxi knows locked after p 3 and Pi |=x t knows locked after ^4. 
Definition 11 (Solutions) 

A plan p is called a solution to a planning problem instance V = (P, 2", (?) iff 

P |=i knows Q after p 
When p is a solution to V, we say that p is a plan that achieves the goal C?. 

According to this definition, it is easy to see that plans P2, P3, and P4 in Example [3] are 
solutions to Pi = (T>i,Xi, Qi) in Example 13 

2.5 Properties of the O-Approximation 

We will now discuss some properties of the O-approximation. For a domain description 
P, we define the size of P to be the sum of (1) the number of fluents; (2) the number of 
actions; and (3) the number of propositions in P. The size of a planning problem instance 
V = (P,X, G) is defined as the size of P. The size of a plan p, denoted by size(p), is 
defined as follows. 

1. size{[}) = 0; 

2. size([a;p\) = 1 + size(p) if a is a non-sensing action andp is apian; and 

3. size([a; cases({i?j — > Pj}? = i)]) = 1 + S" =1 (l + size(pj)) if a is a sensing action 
and pj's are plans. 

Then, we have the following proposition. 
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Proposition 4 

For a domain description T>, an action a, and an a-state 8, computing $(a, S) can be done 
in polynomial time in the size of T>. 

Proof 

see Appendix A. □ 

From this proposition, we have the following corollary. 
Corollary 2.1 

Determining whether or not a plan p is a solution of the planning problem instance V = 
(T>, I, Q) from an a-state S can be done in polynomial time in the size of p and V. 

Definition 12 

The conditional planning problem is defined as follows. 

• Given: A planning problem instance V = (T>, 2, Q) of size n and a polynomial 

Q(n) > n; 

• Determine: whether there exists a conditional plan, whose size is bounded by Q(n), 
that achieves Q from!" (with respect to Definition II 1> . 

Theorem 1 

The conditional planning problem is NP-complete. 

Proof 

see Appendix A. □ 

The above theorem shows that planning using the O-approximation has lower complexity 
than planning with respect to the full semantics. Here, by the full semantics we mean the 
possible world semantics extended to domains with sensing actions. Yet, the price one has 
to pay is the incompleteness of this approximation, i.e., there are planning instances which 
have solutions with respect to the full semantics but do not have solutions with respect to 
the approximation. This can be seen in the following example. 

Example 9 

Consider the planning problem instance V=(T>, X, Q) with 

V = {causes(a, /. {g}), causes(a, /. {-#})}, I = 0, and Q = {/}. 

We can easily check that p = [a] is a plan achieves / from every initial situation (with re- 
spect to the possible world semantics developed for A C K in (Bar al et al. 2()()()b)). However, 
p is not a solution with respect to Definition!^ because T) knows / after a. 

The above example highlights the main weakness of this approximation in that it does 
not allow for reasoning by cases for non-sensing actions or in the presence of disjunctive 
initial situation. In our experiments with the benchmarks, we observe that most of the 
benchmarks that our planner could not solve fall into the second category, i.e., they require 
the capability of reasoning with disjunctive information about the initial state. Given that 
we do not consider action theories with disjunctive initial state, this should not come as a 
surprise. 
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3 A Logic Programming Based Conditional Planner 

This section describes an answer set programming based conditional planner, called ASCP. 
Given a planning problem instance V = (T>,I, Q), we translate it into a logic program 
Kh,w{'P), where h and w are two input parameters whose meanings will become clear 
shortly, and then use an answer set solver (e.g., smodels or cmodels) to compute its 
answer sets. The answer sets of Hh.w {V) represent solutions to V. Our intuition behind this 
task rests on the observation that each plan p (Definition [0 corresponds to a labeled plan 
tree T p defined as below. 

• If p = [] then T p is a tree with a single node. 

• If p — [a], where a is a non-sensing action, then T p is a tree with a single node and 
this node is labeled with a. 

• If p = [a; q], where a is a non-sensing action and q is a non-empty plan, then T p is a 
tree whose root is labeled with a and has only one subtree which is T q . Furthermore, 
the link between a and T ? 's root is labeled with an empty string. 

• If p = [a; cases({<7j — > Pj}" =1 )], where a is a sensing action that determines 
g/s, then T p is a tree whose root is labeled with a and has n subtrees {T Pj | j 6 
{1, . . . , n}}. For each j, the link from a to the root of T Pj is labeled with gj. 

Observe that each trajectory of the plan p corresponds to a path from the root to a leave 
of T p . As an example, Figure \l\ depicts the labeled trees for plans px, p%, p% and p4 in 
Example|3](black nodes indicate that there exists an action occurring at those nodes, while 
white nodes indicate that there is no action occurring at those nodes). 




Fig. 1. Sample plan trees 



For a plan p, let a be the number of leaves of T p and (3 be the number of nodes along the 
longest path from the root to the leaves of T p . a and (3 will be called the width and height 
of T p respectively. Suppose w and h are two integers that such that a < w and (3 < h. 

Let us denote the leaves of T p by &i, . . . , x a . We map each node y of T p to a pair of 
integers n y = (t y ,p y ), where t y is the number of nodes along the path from the root to y, 
and p y is defined in the following way. 

• For each leaf Xi of T p , p Xi is an arbitrary integer between 1 and w. Furthermore, 
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there exists a leaf x with p-value of 1, i.e., p x = 1, and there exist no i ^ j such that 

Px z = Pxj ■ 

• For each interior node y of T p with children yi, . . . , y r , p y = min.{p Vl , . . . ,p Vr }. 

For instance, Figure |2] shows some possible mappings with h = 4 and w = 5 for the 
trees in Figure [0 It is easy to see that if a < w and (3 < h then such a mapping always 




• O • O < 

(2,1) (2, 1) (2,4] (2,3) (2, 1 




• o 



(2,5) (2,1 




Tpi 



Tp 2 



T P3 



Tp 4 



Fig. 2. Possible mappings for the trees in Figure[T] 



exists. Furthermore, from the construction of T p , independently of how the leaves of T p 
are numbered, we have the following properties. 

1. For every node y,t y <h and p y < w. 

2. For a node y, all of its children have the same i-value. That is, if y has r children 
yi, . . . , y r then t Vi = t y . for every 1 < i,j < r. Furthermore, the p-value of y is the 
smallest one among the p-values of its children. 

3. The root of T p is always mapped to the pair (1,1). 

Our encoding is based on the above mapping. We observe that a conditional plan p can 
be represented on a grid h x w where each node y of T p is placed at the position (t y ,p y ) 
relative to the leftmost top corner of the grid. This way, it is guaranteed that the root of T p 
is always placed at the leftmost top corner. Figure [5] depicts the 4x5 grid representation 
of conditional plans T P3 and T P4 in Figure [2] As it can be seen in Figure [3] each path 
(trajectory) of the plan can end at an arbitrary time point. For example, the leftmost and 
rightmost trajectories of Tp 4 end at 2, whereas the others end at 3. On the other hand, to 
check if the plan is indeed a solution, we need to check the satisfaction of the goal at every 
leaf node of the plan, that is, at the end of each trajectory. In our encoding, this task is 
simplified by extending all the trajectories of the plan so that they have the same height 
h + 1 and then checking the goal at the end of each extended trajectory (see Figure [3}- 
Note that an a-state associated with each node on the extended part of each trajectory in 
our encoding will be guaranteed to be the same as the one associated with the end node of 
the original trajectory. 

We now describe the program TTh,w{'P) in the syntax of smodels (for a concrete 
example, see Appendix C). In ith, w {V), variables of sorts time and path correspond 
to rows and columns of the grid. Instead of using the predicate holds(L,T) (see, e.g., 
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( |Dimopoulos et al. 1997|lLifschitz 1999» to denote that a literal L holds at the time T, we 
use the predicate holds(L, T, P) to represent the fact that L holds at node (T, P) (the time 
moment T, the path number P on the grid). 




Fig. 3. Grid representation of conditional plans 



The program TTh tW (V) contains the following elements. 

1 . Constants. There are two constants used in the program h and w which serve as the 
input parameters of the program. In addition, we have constants to denote fluents, 
literals and actions in the domain. Due to the fact that smodels does not allow 
symbol ->, to represent a literal constant -if, we will use neg(f). 

2. Predicates. The program uses the following predicates. 

• time(T) is true if 1 < T < h. 

• timel(Tx) is true if 1 < T\ < h + 1. 

• path(P) is true if 1 < P < w. 

• fluent(F) is true if F is a fluent. 

• liter al(L) is true if L is a literal. 

• contrary(L, Li) is true if L and L\ are two complementary literals. 

• sense(L) is true if L is a sensed literal. 

• action(A) is true if A is an action 

• holds(L, T, P) is true if literal L holds at (T, P). 

• poss(A, T, P) is true if action A is executable at (T, P). 

• occ(A, T, P) is true if action A occurs at (T, P). That means the node (T, P) 
in T p is labeled with action A. 

• e(L,T,P) is true if literal L is an effect of a non-sensing action occurring at 
(T,P). 

• pc(L, T, P) is true if literal L may change at (T + 1, P). 

• goal(T, P) is true if the goal is satisfied at (T, P). 
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• br(G 7 T, P, Pi) is true if there exists a branch from (T, P) to (T + 1, Pi) 
labeled with G in T p . For example, in the grid representation of T P3 (Figure 
|3Jl, we have br(open, 1, 1, 1), br(closed, 1, 1, 2), and br(locked, 1, 1, 5). 

• used(T, P) is true if (T, P) belongs to some extended trajectory of the plan. 
This allows us to know which paths are used in the construction of the plan and 
thus to be able to check if the plan satisfies the goal. As an example, for T P3 in 
Figure|5J we have used(t, 1) for 1 < t < 5, and used(t, 2) and used(t, 5) for 
2 < t < 5. The goal satisfaction, hence, will be checked at nodes used(5, 1), 
used(5, 2), and used(5, 5). 

3. Variables. The following variables are used in the program. 

• F: a fluent variable. 

• L and L\\ literal variables. 

• T and T±: time variables, in ranges l..h and l..h+ 1 respectively, 

• G, G\ and G 2 : sensed— literal variables. 

• A: an action variable. 

• P, Pi, and P 2 : pa£/i variables, in range 

The domains of these variables are declared in smodels using the keyword tdomain 
(see Appendix C for more details). Observe that the type of each variable has to be 
declared accordingly if this feature of smodels is not used. 

4. Rules. The program has the following facts to define variables of sort time and 
path: 

time{l..h) *— 
timel(l..h+l) <- 
path(l..w) «— 

For each action a, fluent /, or sensed-literal g in the domain, Hh,w{P) contains the 
following facts respectively 

action(a) <— 
fluent(f) «- 
sense(g) <— 

The remaining rules of iTh,w{P) are divided into three groups: (i) domain dependent 
rules; (ii) goal representation and (iii) domain independent rules, which are given 
next. Note that they are shown in a shortened form in which the following shortening 
conventions are used. 

• Two contrary literal variables are written as L and ->L. 

• For a predicate symbol p, and a set 7 of literals or actions, we will write 
p(j, . . .) to denote the set of atoms {p(x, . . .) | x G 7}. 

• For a literal constant I, —I stands for neg(f) (resp. /) if I = / (resp. I = -if) 
for some fluent /. 

For example, the rule i28l stands for the following rule 

holds{L, T+l, P) <— holds{L, T, P), contrary{L, L\), not pc{L\, T, P) 
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3.1 Domain dependent rules 

Rules encoding the initial situation. For each v-proposition Q in X, nh,w(P) con- 
tains the fact 

holds{l, 1,1) <- (14) 

Rules encoding actions' executability conditions. For each executability condition 
(0 in V, TTh,w{'P) contains the rule 

poss(a,T,P) <- holds(ip,T,P) (15) 

Rules for reasoning about the effect of non-sensing actions. For each dynamic 
causal law in D, we add to 7Th,w{'P) the following rules: 

e(l,T,P) <- occ{a 1 T,P),holds{(j) 1 T,P) (16) 
pc(l,T,P) «- occ(a,T,P),notholds(l,T,P),notholds(-*l>,T,P) (17) 

Here, a is a non-sensing action. Its execution changes the world according to the 
Pes-function. The first rule, when used along with (1221 . encodes what definitely 
holds as the effect of a in the next a-state. The second rule, when used along with 
Ql> . describes what would potentially be changed by a (see the definitions of e(a, S) 
and pc(a, S) in Subsection l2.4> . Note that in the second rule, not holds(-i(j), T, P) 
stands for {not holds(-^l) \ I £ </>}, meaning that <fi possibly holds at (T, P). These 
rules will be used in cooperation with iZ3\ . 1271 . and d28i to define the next a-state 
after the execution of a non-sensing action. 

Rules for reasoning about the effect of sensing actions. For each k-proposition 
in T>, TTh, w (P) contains the following rules: 

<- occ(a,T,P),notbr(9,T,P,P)(18) 
l{br{g,T,P,X):newJrr(P,X)}l <- occ(a,T,P) (19) 

<- occ(a,T,P),holds(g,T,P) (20) 

The first rule assures that if a sensing action a occurs at (T, P) then there must be 
a branch from (T, P) to (T + 1, P). The second rule ensures that a new branch, 
corresponding to a new successor a-state, will be created for each literal sensed by 
the action. The last rule is a constraint that prevents a from taking place if one of 
the literals sensed by the action is already known. With this rule, the returned plan 
is guaranteed to be optimal in the sense that a sensing action should not occur if 
one of the literals sensed by the action already holds. Observe that the semantics of 
A°k does not prevent a sensing action to execute when some of its sensed-fluents is 
known. For this reason, some solutions to a planning problem instance might not be 
found using this encoding. However, as we will see later, the program will generate 
an "equivalent" plan to those solutions. Subsection l4.2l will elaborate more on this 
issue. 



22 



Phan Huy Tit, Tran Cao Son, and Chitta Baral 



• Rules for reasoning about static causal laws. For each static causal law @ in V, 
^h,w('P) contains the rules 

pc(l,T,P) <- not holds(l,T,P),pc(l',T,P), 



note(^ip 7 T,P) (21) 
(I' G 

e(Z,T,P) <- e(^,T,P) (22) 

holds(l,T u P) <- holdsfaT^P) (23) 



Rules in this group encode the equations dl0t - <!12t and the operator Clx>- 

3.2 Goal representation 
The following rules encode the goal and make sure that it is always achieved at the end of 



every possible branch created by the execution of the plan. 

goal(T u P) <- holds(g,T u P) (24) 

goal(T 1} P) <- holds(L,T 1 ,P),holds(^L,T 1 ,P) (25) 

<- used{h+l,P),not goal(h+\,P) (26) 



The first rule says that the goal is satisfied at a node if all of its subgoals are satisfied at that 
node. The last rule guarantees that if a path P is used in the construction of a plan then the 
goal must be satisfied at the end of this path, that is, at node (h + 1, P). 

Rule d25t deserves some explanation. Intuitively, the presence of holds(L,T, P) and 
holds(^L, T, P) indicates that the a-state at the node (T, P) is inconsistent. This means 
that no action should be generated at this node as inconsistent a-states will be removed 
by the extended transition function (Definition[8}. To achieve this effect 8 , we say that the 
"goal" has been achieved at (T,P). The inclusion of this rule might raise the question: 
is it possible for the program to generate a plan whose execution yields inconsistent a- 
states only. Fortunately, due to Proposition|5] this will not be the case for consistent action 
theories. 

3.3 Domain independent rules 

• Rules encoding the effect of non-sensing actions. Rules ( fTBl - d!7> specify what 
definitely holds and what could potentially be changed in the next a-state as the effect 
of a non-sensing action. The following rules encode the effect and frame axioms for 
non-sensing actions. 

holds(L,T+l,P) «- e(L,T,P) (27) 
holds(L,T+l,P) 4~ holds(L,T,P),notpc(^L,T,P) (28) 

When used in conjunction with (I16> - d!7i . they define the Res function. 

8 The same effect can be achieved by (i) introducing a new predicate, say stop( T, P ), to represent that the a- 
state at (T, P) is inconsistent; (ii) add ing not stop(T, P) in the body of rule 1351 to prevent action to occur 
at (T, P); and (iii) modifying the rule 1261 accordingly. 
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Inertial rules for sensing actions. This group of rules encodes the fact that the 
execution of a sensing action does not change the world. However, there is one-to- 
one correspondence between the set of sensed literals and the set of possible next 
a-states after the execution of a sensing action. 

«- Pi < P 2 , P 2 < P, br{G u T, Pi,P), 

br(G 2l T,P 2 ,P) (29) 
- Pi<P,G 1 ^G 2 MGi,T,P 1 ,P), 

br{Gi,T,P x ,P) (30) 

«- P 1 <P,br{G,T,P 1 ,P),used(T,P) (31) 

used(T+l,P) <- Pi < P,br(G,T,Pi,P) (32) 

holds(G,T+l,P) <- Px<P,br{G,T,Px,P) (33) 
holds(L,T+l,P) <- Pi < P,br(G,T,Pi,P),holds(L,T,Pi) (34) 

The first three rules make sure that there is no cycle in the plan that we are encoding. 
The next rule is to mark a node as used if there exists a branch in the plan that 
coming to that node. This allows us to know which paths on the grid are used in the 
construction of the plan and thus to be able to check if the plan satisfies the goal (see 
rule J26j). 

The last two rules, along with rule (I23> . encode the possible next a-state correspond- 
ing to the branch denoted by literal G after a sensing action is performed in a state 
S. They say that such a-state should contain G (rule i33\ ) and literals that hold in S 
(rule 03). 

Note that because for each literal G sensed by a sensing action a, we create a cor- 
responding branch (rules (II 81 and ( I19» , the rules of this group guarantee that all 
possible next a-states after a is performed are generated. 
Rules for generating action occurrences. 

l{occ(X,T,P) : action{X)}\ <- used(T, P),not goal(T, P) (35) 

<- occ{A,T,P) 7 notposs(A,T,P) (36) 

The first rule enforces exactly one action to take place at a node that was used but the 
goal has not been achieved. The second one guarantees that only executable actions 
can occur. 
Auxiliary Rules. 

Uteral(F) <- (37) 
literal(->F) «- (38) 
contrary(F, ->F) <— (39) 
contrary(^F, F) <- (40) 
new_6r(P,Pi) <- P < P x (41) 
used(l, 1) <- (42) 
used(T+l,P) <- used(T,P) (43) 

The first four rules define literals and contrary literals. Rule J41i says that a newly 
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created branch should outgo to a path number greater than the current path. The last 
two rules mark nodes that have been used. 



4 Properties of ascp 

This section discusses some important properties of ASCP. We begin with how to extract 
a solution from an answer set returned by ASCP. Then, we argue that ASCP is sound and 
complete with respect to the O-approximation semantics. We also show that ASCP can be 
used as a conformant planner. Finally, we present how to modify ASCP to act as a reasoner. 



4.1 Solution Extraction 

In some previous answer set based planners (Dimo poulos et al. 1997| lEiter et al. 20031 
ILifschitz 1999> . reconstructing a plan from an answer set for a logic program encoding 
the planning problem instance is quite simple: we only need to collect the action occur- 
rences in the model and then order them by the time they occur. In other words, if the 
answer set contains occ(a\, 1), . . ., occ(a m , m) then the plan is ai, . . . , a m . For ~Kh, w ^P\ 
the reconstruction process is not that simple because each answer set for ith,w^P) rep- 
resents a conditional plan which may contain conditionals in the form br(l, t,p,pi). The 
following procedure describes how to extract such a plan from an answer set. 

Let V = {T>,I, Q) be a planning problem instance and S be an answer set for "Kk.w^P)- 
For any pair of integers, 1 < i < h + 1,1 < k < w, we define pf(S) as follows: 

if i = h + 1 or occ(a, i,k) $ S for all a 
a; Pi + i (S) if occ(a, i,k) G S and 

j. a is a non-sensing action 

J a; cases({g 3 ■— ► p i ^_ 1 (5)}" =1 ) if occ(a, i, k) G S, 

a is a sensing action, and 

br(gj, i, k, kj) G S for 1 < j < n 

Intuitively, p^{S) is the conditional plan whose corresponding tree is rooted at node (i, k) 
on the grid h x w. p\(S) is, therefore, a solution to P. This is stated in Theorem|2]in the 
next subsection. 



4.2 Soundness and Completeness 

Theorem 2 

Let (T>,T) be a consistent action theory, V — (T>,T, Q) be a planning problem instance 
and h > 1 and w > 1 be integers. If TT^wi'P) returns an answer set S then p\(S) is a 
solution to V. 

Proof 

see Appendix B. □ 

Theorem[2] shows the soundness of irh,w(V). We will now turn our attention to the com- 
pleteness of Kh,w(J > )- Observe that solutions generated by TT/ ltW {'P) are optimal in the 
following sense 
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1 . actions do not occur once the goal is achieved or a possible next a-state does not 
exist; and 

2. sensing actions do not occur if one of its sensed literals holds. 

The first property holds because of rule J35I and the second property holds because of 
constraint i20\ . Since the definition of a conditional plan in general does not rule out non- 
optimal plans, obviously Hh,w{'P) will not generate all possible solutions to V. 

For example, consider the planning problem instance V\ in Example [2] We have seen 
that plans P2, P3, and p4 in Example [5] are all solutions to V\. However, p^ and p4 are not 
optimal because they do not satisfy the above two properties. 

The above example shows that 717^ (P) is not complete w.r.t. the O-approximation in 
the sense that no one-to-one correspondence between its answer sets and solutions to V 
exists. However, we will show next that it is complete in the sense that for each solution 
p to V, there exist two integers h and w such that irj^ w (V) will generate an answer set 
S whose corresponding plan, p\(S), can be obtained from p by applying the following 
transformation (called the reduct operation). 

Definition 13 (Reduct of a plan) 

Let V = (T>,T, Q) be a planning problem instance, p be a plan and 8 be an a-state such 
that 5) ^ _L. A reduct of p with respect to 5, denoted by redact s(p), is defined as 
follows. 

1 . if p = [] or 5 \= Q then 

reduct 5 (p) = [] 

2. if p — [a; q], where a is a non-sensing action and q is a plan, then 

a; reduct^/ (q) if $(a, S) = {5'} 



reduct,5(j>) = 

3. if p = [a; cases({#j — ► _Pj}" =1 )], where a is a sensing action that senses gi, ... , g n , 
then 



reduct,j(p) = 
where 

9j -- 



redact s(pk) if <?/,• holds in 5 for some k 

a; cases({<7j ■ — > <7j}™ =1 ) otherwise 



if CIt>(6 U {gj}) is inconsistent 
reductc^ (6u{ gj })(Pj) otherwise 

Example 10 

Consider the planning problem instance V\ in Example |2] and plans p%, p^, and p4 in 
Example|3] Let 5 = {^open}. We will show that 

reduct 5 (p 3 ) = p 2 (44) 

and 

reduct 5 (p 4 ) = p 2 (45) 
Because open, closed, and locked do not hold in 5, we have 

reducta(p3) = check; cases({open — > qi, closed — > q 2 , locked — > q%}) 
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where qj's are defined as in Definition! 131 
Let 

<5i = CIt>! (8 U {open}) — {open, -^open, closed, -^closed, locked, -^locked} 
62 = CId ± (8 U {closed}) = {—>open, closed, — 'locked} 
S3 = CIt> ± (8 U {locked}) — {-^open, ^closed, locked} 

It is easy to see that q\ = [] (because 81 is inconsistent) and — [] (because the sub-plan 
corresponding to the branch "locked" in p% is empty). 
Let us compute 52- We have 

q2 = reduct,5 2 (flipJock; flipJock; flip Jock) 

Because 8 2 does not satisfy Q and ^(flipJock, 82) = {^2.1} 7^ 0> where 

82.1 = {-"open, -^closed, locked}, 

we have 

<Z2 = / lip Jock; reduct,5 2 1 (flip Jock; flip Jock) 

As #2,1 satisfies Q, we have reduct,5 2 1 (flipJock; flipJock) = []. Hence, q2 = flipJock. 
Accordingly, we have 

reducta(p3) = check; cases({open — * [], closed — > [flipJock], locked — > []}) = p 2 

That is, holds. 

We now show that ( 145 \ holds. It is easy to see that 

rednctsipi) = check; cases({open — > [], closed — > reduct,5 2 (P2), locked — ► []}) 
Because closed holds in <5 2 > we have 

reduct,5 2 (P2) = TedMct$ 2 (flipJock) = flipJock 

Thus, 

redticta (^4) = check; cases({open — > [], closed — > flipJock, locked — » []}) = ^2 
As a result, we have (145 > holds. 

We have the following proposition. 
Proposition 5 

Let P = (T>,T, Q) be a planning problem instance and 8 be its initial a-state. Then, for 
every solution p to V, reduct,j(p) is unique and also a solution to V. 

Proof 

see Appendix B. □ 

The following theorem shows the completeness of our planner with respect to the 0- 
approximation semantics. 



ASP with Sensing Actions, Incomplete Information, and Static Causal Laws 27 
Theorem 3 

Let V — (T>, X, Q) be a planning problem instance, and p be a solution to V. Then, there ex- 
ist two integers h and w such that 717, -a, (P) has an answer set 5 and = reducta(p), 
where S is the initial a-state of (T>, X). 

Proof 

see Appendix B. □ 

4.3 Special Case: ascp as a Conformant Planner 

Since conformant planning deals only with incomplete information, it is easy to see that 
Kh,i(!P) can be used to generate conformant plans for V. 

Let S be an answer set for 717^1 (P). Recall that we assume that each sensing action 
senses at least two literals. Hence, w = 1 implies S does not contain occ(a, . . .) where 
a is a sensing action because if otherwise rules (II 91 and d30t cannot be satisfied. Thus, 
p\(S) is a sequence of non-sensing actions. By Theorem[2] we know that p\(S) achieves 
the goal of V from every possible initial a-state of the domain, which implies that p\{S) is 
a conformant plan. In Section[5] we compare the performance of %h,i{P) against some of 
the state-of-the-art conformant planners. 

4.4 Special Case: ascp as a Reasoner 

It is easy to see that with minor changes, ASCP can be used to compute the consequences of 
a plan. This can be done as follows. Given an action theory (V, X), for any integers h, w, let 
Tr htW (V,X) be the set of rules: m,.JV)\{^-^. @Q-PQ.$ty-$n\.fi5\.$H\}. 
Intuitively, Wh,w (D, X) is the program obtained from TTh,w (^ > ) by removing the rules for ( i) 
generating the branches when sensing actions are executed; ( ii) checking the satisfaction 
of the goal; (Hi) representing the constraints on branches; and (iv) generating action occur- 
rences. For a plan p, let T p be the corresponding tree for p that is numbered according to 
the principles described in the previous section. We define e(p) to be the following set of 
atoms 

{occ(a, t,p) I 3 a node x in T p labeled with action a and numbered with (t,p)} U 
{br(g, t, p, p r ) I 3a link labeled with g that connects the node numbered with (t, p) 
to the node numbered with (t + l,p') in T p }. 

It is easy to see that the program 717^ (T>, X) U e(p) has a unique answer set which corre- 
sponds to $(p, so)- This is detailed in the following proposition. 

Proposition 6 

Let (V, X) be an action theory, p be a plan, p be a fluent formula, T p be the plan tree for p 
with a given numbering, and h and w be the height and width of T p respectively. Let 

n = 7r h ,„(D,2)Ue(p). 

We have that 



• II has a unique answer set S; 
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• T> \=j knows p after p if and only if 

— there exists some j, 1 < j < w, 5h+i,j(S) ^ _L; and 

— foreveryj, 1 < j < w andSh+ij{S) ^ _L, pis known to be true in 5h+i ,j(S). 

• V |=i whether p after p if and only if 

— there exists some j, 1 < j < w, 5h+i,j(S) ^ _L; and 

— for every j, 1 < j < w and Sh+ij{S) ^ _L, p is known in <5/ l+ i j(S). 

where 

!{Z | holds(l,t,j) G S 1 } if used(t,j) G S 1 and 
{/ | holds(l : t, j) G 5} is consistent 
_L otherwise 

Proof 

The proof of this theorem is very similar to the proof of Theorem[2]so we omit it for brevity. 
□ 

5 Evaluation 

In this section, we evaluate ASCP against other planners using planning benchmarks from 
the literature. We first briefly summarize the features of the systems that are used in our ex- 
periments. We then describe the benchmarks. Finally, we present the experimental results. 

5.7 Planning Systems 
The planning systems that we compared with are the following. 

• DLV K : DLV K is a declarative, logic-based planning system built on top of the DLV sys- 
tem Qhttp : / /www, dbai . tuwien . ac . at/pro j/dlv/| ). The input language JC is 
a logic-based planning language described in ( Eit er et al. 200 3 ). The version we used 
for testing is available at |http : / /www . dbai . tuwien . ac . at/pro j/dlv/K/| . 
DLV K is capable of generating both concurrent and conformant plans. It, however, 
does not support sensing actions and cannot generate conditional plans. 

• CMBP (Conformant Model Based Planner) ( Cimatt iaiid Roveri 1999llCimatti and Roveri 2 000 1: 
CMBP is a conformant planner developed by Cimatti and Roveri. A planning domain 

in CMBP is represented as a finite state automaton. BDD (Binary Decision Diagram) 
techniques are employed to represent and search the automaton. CMBP allows non- 
deterministic domains with uncertainty in both the initial state and action effects. 
Nevertheless, it does not have the capability of generating concurrent and conditional 
plans. The input language to CMBP is ATI described in l |Giunchiglia et al. 1997) . 

The version used for testing was downloaded from [http : / /www, cs .Washington . edu/ re sear ch/ j air /content 

• KACMBP JCimatti et al. 2004t : Similarly to CMBP, KACMBP uses techniques from 
symbolic model checking to search in the belief space. However, in KACMBP, the 
search is guided by a heuristic function which is derived based on knowledge asso- 
ciated with a belief state. 
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KACMBP is designated for sequential and conformant setting. It, however, does 
not support concurrent planning and conditional planning. The input language of 
KACMBP is SMV. The system was downloaded from |http: //sra . itc . it/tools/mbp/AI J0471 

• Conformant-FF (CFF) (Brafman and Hoffmann 2004 1: CFF 9 , to our best knowl- 
edge, is one of the current fastest conformant planners in most of the benchmark do- 
mains in the literature. It extends the classical FF planner ( Hoffm ann and Nebel 200 H 
to deal with uncertainty in the initial state. The basic idea is to represent a belief state 
s just by the initial belief state (which is described as a CNF formula) together with 
the action sequence that leads to s. In addition, the reasoning is done by checking 
the satisfiability of CNF formulae. 

The input language of CFF is a subset of PDDL with a minor change that allows the 
users to specify the initial state as a CNF formula. Both sequential and and confor- 
mant planning are supported in CFF. However, it does not support concurrent and 
conditional planning. 

• MBP dBertoli et al. 200U : MBP is a previous version of CMBR Unlike CMBP which 
only deals with conformant planning, MBP supports conditional planning as well. 

The version used for testing was downloaded from |http : //sra . itc . it /tools /mbp71 

• SGP (Sensory Graph Plan) (IWeld et al. 19981 lAndersone t al. 1998 1: SGP is a plan- 
ner based on the planning graph algorithm proposed by Blum and Furst in ( Blu m and Furst 95V 
SGP supports conditional effects, universal and existential quantification. It also han- 
dles uncertainty and sensing actions. SGP has the capability of generating both con- 
formant and conditional plans, as well as concurrent plans. Nevertheless, static laws 

are not allowed in SGP. The input syntax is PDDL (Planning Domain Definition 
Language). The version used for testing is l.Oh (dated January 14th, 2000), written 
in Lisp, available at |http : / /www, cs . Washington . edu/ai/sgp ■ html| . 

• POND ( Bryce et al. 2004 1: POND extends the planning graph algorithm ( Bl um and Furst 95l 
to deal with sensing actions. Conformant planning is also supported as a feature 

of POND. The input language is a subset of PDDL. POND was downloaded from 
|http : //rakaposhi . eas . asu . edu/belief-search/| 

Table^summarizes the features of these planning systems. 





ASCP 


DLV K 


MBP 


CMBP 


SGP 


POND 


CFF 


KACMBP 


Input Language 


A C K 


K 


An 


An 


PDDL 


PDDL 


PDDL 


SMV 


Sequential planning 


yes 


yes 


yes 


yes 


no 


yes 


yes 


yes 


Concurrent planning 


no 


yes 


no 


no 


yes 


no 


no 


no 


Conformant planning 


yes 


yes 


yes 


yes 


yes 


yes 


yes 


yes 


Conditional planning 


yes 


no 


yes 


no 


yes 


yes 


no 


no 



Table 1 . Features of Planning Systems 



We would like to thank Jorg Hoffmann for providing us with an executable version of the system for testing. 
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5.2 Benchmarks 

To test the performance of the planners, we prepared two test suites for conformant and 
conditional planning, separately. In our preparation, we attempt to encode the planning 
problem instances given to the systems in a uniform way (in terms of the number of actions, 
fluents, and effects of actions). Due to the differences in the representation languages of 
these systems, there are situations in which the encoding of the problems might be different 
for each system. 

5.2.7 Conformant Planning 
We tested the systems on the following domains 10 : 

• Bomb in the Toilet (BT): This set of problems was introduced in ( McDermot t 1987ft : 
"It has been alarmed that there is a bomb in a lavatory. There are m suspicious pack- 
ages, one of which contains the bomb. The bomb can be defused if we dunk the 
package that contains the bomb into a toilet." Experiments were made with m = 2, 
4, 6, 8, and 10. 

• Bomb in the Toilet with Multiple Toilets (BMT): This set of problems is simi- 
lar to the BT problem but we have multiple toilets. There are five problems in this 
set, namely BMT(2, 2), BMT{4, 2), BMT(6, 2), BMT(8, 4), and BMT {10, 4), 
where the first parameter is the number of suspicious packages and the second pa- 
rameter is the number of toilets. 

• Bomb in the Toilet with Clogging (BTC): This set of problems is similar to BTs 
but we assume that dunking a package clogs the toilet and flushing the toilet unclogs 
it. We know that in the beginning, the toilet is unclogged. We did experiments with 
m = 2,4, 6, 8, and 10, where rn is the number of suspicious packages. 

• Bomb in the Toilet with Multiple Toilets and Clogging (BMTC): This set of prob- 
lems is similar to BTC but we have multiple toilets. We did experiments with five 
problems BMTC{2, 2), BMTC{4, 2), BMTC{6, 2), BMTC{8, 4), and 

BM TC(10, 4), where the first parameter is the number of suspicious packages and 
the second parameter is the number of toilets. 

• Bomb in the Toilet with Clogging and Uncertainty in Clogging (BTUC): This 
set of problems is similar to BTC except that we do not know whether the toilet is 
clogged or not in the beginning. 

• Bomb in the Toilet with Multiple Toilets and Uncertainty in Clogging (BM- 
TUC): This set of problems is similar to BMTC except that we do not know whether 
or not each toilet is clogged in the beginning. 

• Ring: This set of problems is from (Ci matti et al. 20 04). In this domain, one can 
move in a cyclic fashion (either forward or backward) around a rt-room building to 
lock windows. Each room has a window and the window can be locked only if it is 
closed. Initially, the robot is in the first room and it does not know the state (open, 
closed or locked) of the windows. The goal is to have all windows locked. A possible 



The system is available at htt p : / / www . cs ■ nmsu . edu/ ~tson/ASPlan/Sensinq| 
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conformant plan is to perform a sequence of actions forward, close, lock repeatedly. 
In this domain, we tested with n =2,4,6,8, and 10. 

• Domino (DOM): This domain is very simple. We have n dominos standing on a 
line in such a way that if one of them falls then the domino on its right also falls. 
There is a ball hanging close to the leftmost one. Touching the ball causes the first 
domino to fall. Initially, the states of dominos are unknown. The goal is to have the 
rightmost one to fall. The solution is obviously to touch the ball. In this domain, we 
tested with n =10,20,50,100, 1000, and 10000. 

5.2.2 Conditional Planning 
The set of problems for testing includes: 

• Bomb in the Toilet with Sensing Actions (BTS): This set of examples is taken 
from (Wel d et al. 199 8 ). They are variations of the BTC problem that allow sensing 
actions to be used to determine the existence of a bomb in a specific package. There 
are m packages and only one toilet. We can use one of the following methods to 
detect a bomb in a package: (1) use a metal detector (action detect -metal); (2) use a 
trained dog to sniff the bomb (action sniff); (3) use an x-ray machine (action xray); 
and, finally, (4) listen for the ticking of the bomb (action listen-f or -ticking). 
This set of examples contains four subsets of problems, namely BTSl(m), 
BTS2(m), BTS3(m), and BTS4(m) respectively, where m is the number of sus- 
picious packages. These subsets differ from each other in which ones of the above 
methods are allowed to use. The first subset allows only one sensing action (1); the 
second one allows sensing actions (l)-(2); and so on. 

• Medical Problem (MED): This set of problems is from (IWeld etal. 1998i . A patient 
is sick and we want to find the right medication for her. Using a wrong medication 
may be fatal. Performing a throat culture will return either red, blue, or white, which 
determines the group of illness the patient is infected with. Inspecting the color (that 
can be performed only after the throat culture is done) allows us to observe the color 
returned by a throat culture, depending on the illness of the patient. Analyzing a 
blood sample tells us whether or not the patient has a high white cell count. This 
can be done only after a blood sample is taken. In addition, we know that in the 
beginning, the patient is not dead but infected. In addition, none of the tests have 
been done. 

There are five problems in this set, namely, MED1, . . ., A1ED5. These problems 
are different from each other in how much we know about the illness of the patient 
in the beginning. 

• Sick Domain (SICK): This set of problems is similar to MED. A patient is sick and 
we need to find a proper medication for her. There are n kinds of illness that she 
may be infected with and each requiring a particular medication. Performing throat 
culture can return a particular color. Inspecting that color determine what kind of 
illness the patient has. Initially, we do not know the exact illness that the patient is 
infected with. 

The characteristic of this domain is that the length of the plan is fixed (only 3) but 
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the width of the plan may be large, depending on the number of illnesses. We did 
experiments with five problems in the domain, namely, SICK(2), SICK(4), 
SICK(10). They differ from each other in the number of illnesses that the patient 
may have. 

• Ring (RINGS): This domain is a modification of the RING domain. In this mod- 
ified version, the agent can close a window only if it is open. It can lock a window 
only if it is closed. The agent can determine the status of a window by observing it 
(sensing action observe-window). 

• Domino (DOMS): This is a variant of the DOM domain in which some domi- 
nos may be glued to the table. Unlike the original version of the DOM domain, 
in this variant, when a domino falls, the next one falls only if it is not glued. The 
agent can do an action to unglue a glued domino. We introduce a new sensing action 
ob serve -domino (X) to determine whether a domino X is glued or not. 

5.3 Performance 

We ran our experiments on a 2.4 GHz CPU, 768MB RAM, DELL machine, running 
Slackware 10.0 operating system. We compared ASCP with DLV K , CMBP, SGP, CFF and 
KACMBP on the conformant benchmarks and with SGP, POND, and MBP on the condi- 
tional benchmarks. Time limit was set to 30 minutes. The CMU Common Lisp version 19a 
was used to run SGP examples. We ran ASCP examples on both cmodels and smodels. 
By convention, in what follows, we will use ASCP C and ASCP S to refer to the planner ASCP 
when it was run on cmodels and smodels respectively. Sometimes, if the distinction 
between the two is not important, by ASCP we mean both. 

The experimental results for conformant and conditional planning are shown in Tables[2] 
and|3]respectively. Times are in seconds. "TO/AB" indicates that the corresponding planner 
does not return a solution within the time limit or stopped abnormally due to some reasons, 
for example, out of memory or segmentation fault. 

In conformant setting (Table 0, it is noticeable that ASCP C behaves better than ASCP S 
in all the conformant benchmark domains, especially in large problems. Furthermore, CFF 
and KACMBP are superior to all the other planners on most of the testing problems. Espe- 
cially, both of them scale up to larger instances very well, compared with the others. Yet, it 
is interesting to observe that ASCP C does not lose out a whole lot against these two planners 
in many problems. In the following, we will discuss the performance ASCP in comparison 
with CMBP, DLV K , and SGP. 

It can be seen that ASCP C is competitive with CMBP and outperforms DLV K and SGP 
in most of problems. Specifically, in the BT domain, ASCP took only 0.12 seconds to 
solve the last problem, while DLV K , CMBP, and SGP took 11.37, 0.5 and 2.13 seconds 
respectively. ASCP S however is slower than CMBP and SGP in this domain. 

In the BMT domain, ASCP S is the worst. ASCP S took more than two minutes to solve 
the largest problem in this domain, while CMBP took only 0.53 seconds. ASCP C , however, 
is competitive with CMBP and outperforms both DLV K and SGP. 

In the ETC domain, although ASCP S is better than DLV K and SGP, its performance is far 
from that of CMBP. The time for ASCP 5 to solve the largest problem is nearly 8 minutes, 
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Problem 


Min. 


ASCP 


DLV K 


CMBP 


SGP 


CFF 


KA- 




PL 


cmodels 


smodels 










CMBP 


BT(2) 


2 


0.06 


0.03 


0.01 


0.03 


0.04 


0.02 


0.12 


BT(4) 


4 


0.04 


0.06 


0.03 


0.03 


0.27 


0.04 


0.12 


BT(6) 


6 


0.05 


0.12 


0.18 


0.04 


0.42 


0.09 


0.1 


BT(8) 


8 


0.10 


0.33 


1.47 


0.10 


1.04 


0.10 


0.11 


BT(10) 


10 


0.12 


2.54 


11.37 


0.50 


2.13 


0.13 


0.11 


BMT(2,2) 


2 


0.04 


0.04 


0.01 


0.03 


0.07 


0.02 


0.07 


BMT(4,2) 


4 


0.05 


0.09 


0.03 


0.04 


0.28 


0.03 


0.12 


BMT(6,2) 


6 


0.11 


0.23 


0.19 


0.05 


0.29 


0.07 


0.10 


BMT(8,4) 


8 


0.41 


4.70 


1.70 


0.11 


3.14 


0.09 


0.11 


BMT(10,4) 


10 


0.51 


152.45 


12.18 


0.53 


5.90 


0.12 


0.14 


BTC(2) 


2 


0.04 


0.04 


0.01 


0.03 


0.44 


0.05 


0.12 


BTC(4) 


7 


0.04 


0.12 


0.33 


0.04 


21.62 


0.06 


0.10 


BTQ6) 


11 


0.06 


0.33 


TO 


0.1 


TO 


0.07 


0.11 


BTC(8) 


15 


0.11 


0.53 


TO 


0.79 


TO 


0.07 


0.13 


BTC(IO) 


19 


0.12 


468.04 


TO 


9.76 


TO 


0.13 


0.14 


BMTC(2,2) 


2 


0.06 


0.06 


0.01 


0.03 


0.18 


0.05 


0.12 


BMTQ4.2) 


6 


0.10 


0.19 


0.17 


0.05 


2.03 


0.04 


0.09 


BMTC(6,2) 


10 


0.14 


0.63 


20.02 


0.24 


TO 


0.07 


0.12 


BMTC(8,4) 


12 


0.56 


60.56 


TO 


TO 


TO 


0.10 


0.12 


BMTC(10,4) 


16 


1.44 


TO 


TO 


TO 


TO 


0.13 


0.17 


BTUQ2) 


4 


0.05 


0.04 


0.02 


0.02 


0.59 


0.03 


0.09 


BTUQ4) 


8 


0.04 


0.11 


0.94 


0.04 


TO 


0.04 


0.11 


BTUQ6) 


12 


0.06 


0.22 


524.3 


0.11 


TO 


0.06 


0.11 


BTUQ8) 


16 


0.11 


4.7 


TO 


0.96 


TO 


0.08 


0.12 


BTUC(IO) 


20 


0.12 


TO 


TO 


11.58 


TO 


0.13 


0.16 


BMTUQ2.2) 


4 


0.06 


0.07 


0.03 


0.03 


16.11 


0.06 


0.11 


BMTUQ4.2) 


8 


0.10 


0.23 


0.24 


0.07 


TO 


0.09 


0.14 


BMTUC(6,2) 


12 


0.14 


19.88 


1368.28 


0.43 


TO 


0.08 


0.14 


BMTUC(8,4) 


16 


0.56 


TO 


TO 


TO 


TO 


0.13 


0.18 


BMTUC(10,4) 


20 


0.63 


TO 


TO 


TO 


TO 


0.16 


0.16 


RING(2) 


5 


0.12 


0.47 


0.201 


0.04 


0.14 


0.05 


0.00 


RING(4) 


11 


0.21 


6.76 


0.638 


0.05 


2.28 


0.09 


0.12 


RING(6) 


17 


31.73 


TO 


TO 


0.40 


77.10 


0.20 


0.13 


RING(8) 


23 


1246.58 


TO 


TO 


832.73 


TO 


0.74 


0.18 


RING(IO) 


29 


TO 


TO 


TO 


TO 


TO 


2.46 


0.18 


DOM(IO) 


1 


0.11 


0.08 


0.03 


0.04 


2.24 


0.05 


0.13 


DOM(20) 


1 


0.14 


0.07 


0.24 


0.05 


33.4 


0.29 


0.14 


DOM(50) 


1 


0.47 


0.40 


1368.28 


0.06 


1315.98 


4.44 


1.34 


DOM(IOO) 


1 


1.70 


1.64 


TO 


0.11 


TO 


TO 


2.56 


DOM(500) 


1 


31.28 


32.52 


TO 


2.16 


TO 


TO 


29.10 


DOM(IOOO) 


1 


121.91 


129.96 


TO 


9.83 


TO 


TO 


TO 



Table 2. Conformant Planning Performance 



while that for CMBP is just 9.76 seconds. Again, ASCP C is the best. It took only 0.12 
seconds to solve the same problem. 

The BMTC domain turns out to be hard for DLV K , CMBP, and SGP. None of them were 
able to solve the BMTC(8, 4) within the time limit. Although ASCP S was able to solve 
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Problem 


Min. Plan 


ASCP 


SGP 


POND 


MBP 




Length & Width 


cmodels 


smodels 








BTS1(2) 


2x2 


0.166 


0.088 


0.11 


0.188 


0.047 


BTS1(4) 


4x4 


0.808 


1.697 


0.22 


0.189 


0.048 


BTS1(6) 


6x6 


5.959 


83.245 


2.44 


0.233 


0.055 


BTS1(8) 


8x8 


25.284 


TO 


24.24 


0.346 


0.076 


BTS1(10) 


10x10 


85.476 


TO 


TO 


0.918 


0.384 


BTS2(2) 


2x2 


0.39 


0.102 


0.19 


0.186 


0.038 


BTS2(4) 


4x4 


1.143 


3.858 


0.32 


0.198 


0.067 


BTS2(6) 


6x6 


19.478 


1515.288 


3.23 


0.253 


2.163 


BTS2(8) 


8x8 


245.902 


TO 


25.5 


0.452 


109.867 


BTS2(10) 


10x10 


345.498 


TO 


TO 


1.627 


178.823 


BTS3(2) 


2x2 


0.357 


0.13 


0.22 


0.185 


0.082 


BTS3(4) 


4x4 


1.099 


5.329 


0.44 


0.195 


1.93 


BTS3(6) 


6x6 


7.055 


TO 


3.89 


0.258 


147.76 


BTS3(8) 


8x8 


56.246 


TO 


28.41 


0.549 


AB 


BTS3(10) 


10x10 


248.171 


TO 


TO 


2.675 


AB 


BTS4(2) 


2x2 


0.236 


0.149 


0.26 


0.194 


0.098 


BTS4(4) 


4x4 


1.696 


3.556 


0.64 


0.191 


AB 


BTS4(6) 


6x6 


13.966 


149.723 


4.92 


0.264 


AB 


BTS4(8) 


8x8 


115.28 


TO 


30.34 


0.708 


AB 


BTS4(10) 


10x10 


126.439 


TO 


TO 


4.051 


AB 


MED(1 ) 


lxl 


1.444 


1.434 


0.09 


0.187 


0.048 


MED(2) 


5x5 


35.989 


9.981 


0.59 


0.193 


0.047 


MED(3) 


5x5 


42.791 


9.752 


1.39 


0.2 


0.049 


MED(4) 


5x5 


39.501 


10.118 


7.18 


0.205 


0.049 


MED(5) 


5x5 


35.963 


9.909 


44.64 


AB 


0.05 


SICK(2) 


3x2 


0.234 


0.121 


0.21 


0.189 


0.045 


SICK(4) 


3x4 


0.901 


0.797 


10.29 


0.19 


0.048 


SICK(6) 


3x6 


5.394 


3.9 


TO 


0.201 


0.059 


SICK(8) 


3x8 


17.18 


14.025 


TO 


0.221 


0.129 


SICK(IO) 


3x10 


82.179 


43.709 


TO 


0.261 


0.778 


RINGS(l) 


3x3 


0.768 


0.14 


0.67 


0.198 


0.045 


RINGS(2) 


7x9 


1386.299 


TO 


TO 


0.206 


0.057 


RINGS(3) 


11x27 


TO 


TO 


TO 


0.391 


0.207 


R1NGS(4) 


15x64 


TO 


TO 


TO 


3.054 


3.168 


DOMS(l) 


3x1 


0.117 


0.203 


0.11 


0.08 


0.043 


DOMS(2) 


5x4 


0.306 


0.325 


48.82 


0.183 


0.048 


DOMS(3) 


7x8 


3.646 


53.91 


TO 


0.19 


0.057 


DOMS(4) 


9x16 


87.639 


TO 


TO 


0.248 


0.101 


DOMS(5) 


11x32 


TO 


TO 


TO 


0.687 


0.486 



Table 3. Conditional Planning Performance 

this instance, it could not solve the last instance. ASCP C on the contrary can solve these 
instances very quickly, less than two seconds for each problem. 

In the BTUC and BMTUC domains, although not competitive with ASCP C , CMBP 
outperforms both DLV K and SGP. For example, CMBP took less than 12 seconds to solve the 
largest instance in the BTUC domain, while ASCP S , DLV K , and SGP indicated a timeout. 
ASCP S is competitive with DLV K and much better than SGP. Its performance is worse than 
CMBP in these domains however. 
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The RING domain is really hard for the planners except CFF and KACMBP. CFF 
and KACMBP took just a few minutes to solve the largest problem; however, KACMBP 
seems to scale up better than CFF on this domain. None of the other planners could solve 
the last problem. Among the others, CMBP is the best, followed by ASCP C . CMBP took 
around 14 minutes to solve RING (8) while ASCP C took more than 20 minutes. ASCP 5 is 
outperformed by both DLV K and SGP 

In the last domain, DOM, again, CMBP outperforms ASCP, DLV K , and SGP. The solving 
time of ASCP for the last problem is around 2 minutes, while that for CMBP is just less 
than 10 seconds. DLV K and SGP were able to solve the first three instances of this domain 
only. It is worth noting here that the not-very-good performance of CFF and KACMBP on 
this domain is because that this domain is in nature very rich in static causal laws, a feature 
that is not supported by CFF and KACMBP. Therefore, to encode the domain in CFF and 
KACMBP, we had to compile away static causal laws. 

The performance of ASCP in the conditional benchmarks is not as good as in the confor- 
mant benchmarks, compared with other testing planners. As can be seen in Table it was 
outperformed by both POND and MBP in the benchmarks, except in the last two problems 
of the BTS3 domain or in the last three of the BTS4, where M BP had a problem with 
segmentation fault or memory excess, or in MED(5) problem where POND stopped ab- 
normally. Both POND and MBP did very good at testing domains. POND took just a few 
seconds to solve each instance in the testing domains. ASCP is also not competitive with 
SGP in small instances of the first five domains (BTSl-MED). However, when scaling up 
to larger problems, ASCP C seems to be better than SGP. In the last three domains (SICK, 
RINGS, and DOMS), SGP is outperformed by both ascp c and ascp s . 

6 Conclusion and Future Work 

In this paper, we define an approximation for action theories with static causal laws and 
sensing actions. We prove that the newly developed approximation is sound with respect to 
the possible world semantics and is deterministic when non-sensing actions are executed. 
We also show that the approximation reduces the complexity of the conditional planning 
problem. 

We use the approximation to develop an answer set programming based conditional 
planner, called ASCP. ASCP differs from previously developed model-based planners for 
domains with incomplete initial state (e.g. ( Bonet and Geffne~0 00 Cima tti and Roveri 1999 
lEiter et al. 20031 ISmith and Weld 1 998 1) in that it is capable of dealing with sensing ac- 
tions and generating both conditional and conformant plans. We prove the correctness of 
ASCP by showing that plans generated by ASCP are solutions of the encoded planning 
problem instances. Furthermore, we prove that ASCP will generate a solution to V if it has 
a solution with respect to the given approximation. We also discuss the use of ASCP in 
reasoning about effects of conditional plans. 

We compare ASCP with several planners. These results provide evidence for the useful- 
ness of answer set planning in dealing with sensing actions and incomplete information. 
Our experiments also show that there are situations in which ASCP does not work as well as 
other state-of-the-art planners. In the future, we would like to investigate methods such as 
the use of domain knowledge to speed up the planning process (see e.g. (ISon et al. 20 05a i). 
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Appendix A - Proofs related to the O-Approximation 

This appendix contains the proofs for the propositions and theorems given in the paper. As 
stated, we assume that the body of each static law ® is not an empty set and Q ^ § for 
every planning problem (T>, X, Q). 

We begin with a lemma about the operator Civ that will be used in these proofs. We 
need the following definition. Given a domain description T>, for a set of literals a, let 

r(cr) = a U {I | 3if(2, ip) e V such that ip C a}. 

Let r°(cr) = r(cr) and r i+1 (» = r(P(cr)) for i > 0. Since, by the definition of T, 
for any set of literals a 1 we have a' C T(a'), the sequence (T l (a))°l Q is monotonic with 
respect to the set inclusion operation. In addition, (T l (a)) is bounded by the set of fluent 
literals. Thus, there exists o- llmlt such that a v lmlt = U^Q^^a). Furthermore, cr v lmlt is 
unique and satisfies all static causal laws in T>. 

Lemma 1 

For any set of literals a, we have <7^, lmlt = Clx>(a). 
Proof 

By induction we can easily show that T l (a) C Cl-oip) for all i > 0. Hence, we have 

a v imit C Cl v {a) 

Furthermore, from the construction of r*(cr), it follows that a llmlt satisfies all static causal 
laws in V. Because of the minimality property of Clx>[p), we have 

Accordingly, we have 

crp mlt = CItj(<j) 

□ 

The following corollary follows immediately from the above lemma. 
Corollary 6.1 

For two sets of literals a C a' , CZp(cr) C Cfe(cr'). 

For an action a and a state s, let e(a, s) = Clx>{E(a : s)). We have the following lemma: 
Lemma 2 

Let a be an action and s, s' be states. Then, we have 

Clv(E(a, s) U (s n s')) = Cl v (e(a, s) U (s n s')) 
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Proof 

Let 7 = E(a, s) U (s D s') and 7' = e(a, s) U (s Pis'). As 7 C 7', it follows from Corollary 
16. ll that to prove this lemma, it suffices to prove that 

ci v {i) C Clv{l) 

It is easy to see that 

7' = Cl v {E(a, s)) U (s n s') C Cl v {E{a, s) U (s n s')) = C7x>(7) 
Therefore, by Corollarv l6.ll we have 

Cte(7') C Clv{Clvh)) = Clv{l) 

Proof done. □ 

Proof of Proposition^ 

Lemma 3 

For every state s' E Res v (a, s), we have 

s' \ (e(o, s) U (s ("1 s')) C pc(o, 5) 

Proof 

Let g denote e(a, s) U (s n s'). By Corollary 16. II since e(a, 5) C e(a, s) C cr, we have 

Cl v {e{a, 8)) C C7 p (cr) = s' (46) 
We now show that, for every i > 1, 

r^xr-^cpc^S) (47) 

by induction on i. 

1. Base case: i = 1. Let / be a literal in r 1 (cr) \ r°(cr). We need to prove that / 6 pc x {a, 8). 
By the definition of T, it follows that 

I £ r°(cr) = cr (48) 

/er 1 (CT)c s / (49) 

and, in addition, there exists a static causal law 

if 

in D such that 

<^Cr°(cr)=Cr (50) 

By ( 1481 . we have I ^ (sfl s'). By d49> . we have 2 € s'. Accordingly, we have I ^ s. On the 
other hand, because 5 C s, we have 

2^5 (51) 

It follows from ( 1501 that <p C s' since <r C s'. Because of the completeness of s', we have 
-up PI s' = 0. On the other hand, by ( I46i . we have C7x>(e(a, 5)) C s'. As a result, we have 

^nCi c (e(a t i)) =0 (52) 
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We now show that p % s. Suppose otherwise, that is, ip C s. This implies that I G s. By 
i49i , it follows that I G (s fl s') C a and this is a contradiction to (I48> . Thus, cp <2 s. 
On the other hand, we know that ip C a = e(a, s) U (s (~l s') and thus we have y> n (e(a, s) \ 
s) 7^ 0. In addition, it is easy to see that e(a, s) \ s C e(a, s)\S C pc°(a, 5). Therefore, 
we have 

^npc°(a,(5) / (53) 

From ( 15 1> - ( I53> . and by the definition of pc 1 (a, S), we can conclude that I G pc 1 (a, 5). 
The base case is thus true. 
2. Inductive Step: Assume that (147 > is true for all i < k. We need to prove that it is true for 
i = k + 1. Let I be a literal in r fe+1 (cr) \ r fe (cr). We will show that / G pc k+1 (a, S). 
By the definition of T, there exists a static causal law 

If (Z,p) 

in P such that 

<^cr fc (CT)c s ' (54) 

Because ^ C s', we have -tip D s' = 0. In addition, by (I46> . Cl-p(e(a, 5)) is a subset of s'. 
A a result, we have 

n^nCIi,(e(a,iS)) = | (55) 

It is easy to see that ip <2 Y k ~ 1 (cr) for if otherwise then, by the definition of T, I must be in 
r fc (<r), which is impossible. In other words, there exists I' G p> such that /' r 1 (o") but 
I' G r fc (cr). By the inductive hypothesis, we have I' G pc k {a, 5), which implies that 

pn P c k (a,5) ^ (56) 

Because I ^ r fc (er), we have / ^ a. As a result, I ^ (sfl s'). On the other hand, since 
I g r fe+1 (cr) C s', it follows that I g s. Thus, we have 

I $ S (57) 

From (|55} - d571 . and by the definition of pc fc+1 (a. <5), it follows that Z G pc fc+1 (a, J). So 
the inductive step is proven. 

As a result, it is always the case that ( 1471 holds. Hence, we have 

r i (a)\*c\J(pc>(a,6))=pc i (a ) 8) 

3=0 

and thus, 

oc oo 

\J(r(a)\a)c\Jpc\a,5) 

i=0 i=0 

Accordingly, by Lemma {1} and by the definition of pc{a, 8), we have 

(s' \ a) C pc(a, <5). 

The lemma is thus true. □ 
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We now prove Proposition^ Let 

j = e(a,6)U(5\^pc(a,S)) 8' = Cl v (~/) 

Let s' be some state in Res^-,(a, s). Such an s' exists because T> is consistent. By Lemma 
|2]and by Definition|2] we have 

s' = Cl v (a) (58) 

where 

a = e(a,s) U (s n s') 

To prove Proposition^ it suffices to prove that 5' C s' . But first of all, let us prove, by 
induction, the following 

P( 7 ) C s' (59) 

for every integer i > 0. 

1. Base Case: i — 0. Assume that I G r°(7) = 7. We need to show that Z G s'. There 
are two possibilities for Z G 7. 

a) I G e(a, 5). It is easy to see that Z G s' because 

e(a, 6) C e(a, s) C er C CId{o) = s . 

b) Z g" e(a. 5), Z G <5, and -iZ g" pc(a, 5). Since i C s, we have Z G s. Because of 
the completeness of s, it follows that -1/ g" s. Accordingly, we have 

-.Z £ (s n s') (60) 

On the other hand, because ^Z g" pc(o,J), ->Z ^ s, and (e(a, s) \ s) G 
pc°(a, 5) C pc(a, 5), we have 

-nZ£e(a,s) (61) 

From ( I60t and i61i . it follows that -^1 £ a. In addition, since ->Z g" pc(a, (5), 
by Lemma|3l we have ->Z ^ s' \ o\ Accordingly, we have ^Z ^ s'. Because s' 
is complete, we can conclude that I G s'. 

2. Inductive Step: Assume that d59l is true for all i < k. We need to show that 
p fc +!( 7 ) c s'. Let I be a literal in r fe+1 (7). By the definition of r fe+1 (7), there are 
two possibilities for Z: 

a) Z G r fc (7). Clearly, in this case, we have I G s'. 

b) there exists a static causal law 

if(Z,v) 

in T> such that i^cr' (7), 

By the inductive hypothesis, we have <p C s' . Hence, I must hold in ,s'. 
Therefore, in both cases, we have I G s' . This implies that T fc+1 (7) C s' . 
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As a result, d59t always holds. By Lemma[2 we have 

8' = \Jr( 7 )c s > 

i=0 

Since s' is a state, 5' is consistent. Thus, by the definition of the i?es-function, we have 

Res v (a,8) = {8'} 

Furthermore, 8' C s' for every s' £ Res^(a, s). 
The proposition is proven. 

Proof of Proposition^ 

Since 8 is valid, there exists a state s such that 8 C s. 

On the other hand, we assume that in every state of the world, exactly one literal in 9 
holds, there exists a literal g £ 9 such that g holds in s and for all g' e 6\ {g}, g' does not 
hold in s. 

Accordingly, we have 8 U {g} C s. By Corollary 16. II we have 8' — Clx>{8 U {g}) C 
Clx>{s) = s. Hence, 5' is consistent. By the definition of the Res— function, we have 
8' G Res-p(a, 8). Since 8' C s, <5' is a valid a-state. 

The proposition is thus true. 

Proof of Proposition^ 

Let us prove this proposition by using structural induction on p. 

1. p = []. Trivial. 

2. p — [a; q], where q is a conditional plan and a is a non-sensing action. 
Assume that Proposition's true for q. We need to prove that it is also true for p. 
Suppose $(p, 8) _L. Clearly we have $(a, 8) ^ _L. 

Therefore, we have <£>(a, 8) = i?esp(a, <$). On the other hand, since 8 is a valid a- 
state, it follows from Proposition Q that Resx>{a 1 8) = {8'} for some valid a-state 
8'. 

As a result, we have (5') contains at least one valid a-state. Hence, 8) ^ _L 
contains at least one valid a-state. 

3. p = [a; cases({gj — » Pj}™ =1 )], where a is a sensing action that senses gi, . . . , g n . 
Assume that Proposition|5]is true fork's. We need to prove that it is also true for p. 
Because 8) ^ _L, we have <l>(a, 8) ^ _L. By the definition of the <&-function, we 
have $(a, 8) = Res-p(a, 8). As 8 is valid, by Proposition^ Resxi{a : 8) contains at 
least one valid a-state 8'. 

By the definition of the Res— function for sensing actions, we know that 8' = 
CI-d(8 U {gk}) for some k. This implies that holds in 8'. 
By the inductive hypothesis, we have <t(pk, 8') contains at least one valid a-state. 
By the definition of the ^-function, we have $(pfc,<5') C 8). Thus, $(p, 8) 
contains at least one valid a-state. 



ASP with Sensing Actions, Incomplete Information, and Static Causal Laws 41 



Proof of Proposition^ 

Let n denote the size of V. Because of Lemma \l\ we can conclude that for any set of 
literals a, computing Cl-p(a) can be done in polynomial time in n. 

Observe that for a non-sensing action a and an a-state 5, computing e(a, S) and pc(a, 5) 
can be done in polynomial time in n. Thus, computing $(a, S) can be done in polynomial 
time in n. 

Likewise, computing $(a, S) for a sensing action a can also be done in polynomial time 
in n. 

Hence, Proposition|4]holds. 

Proof of Theorem Q 

The proof is similar to the proof of Theorem 3 in (Bar al et al. 20 00a I which states that the 
conditional planning problem with respect to the 0-approximation in iSon and Baral 200TI 
is NP-complete. Membership follows from Corollarv l2.ll Hardness follows from the fact 
that the approximation proposed in this paper coincides with the 0-approximation in ISon and Baral 206"Tt 
i.e, the conditional planning problem considered in this paper coincides with the planning 
problem with limited-sensing in ( B aral et al. 2000al which is NP-complete. By the restric- 
tion principle, we conclude that the problem considered in this paper is also NP-complete. 

Appendix B - Proofs related to it 

This section contain proofs related to the correctness of tt. Before we present the proofs, 
let us introduce some notations that will be used throughout the rest of the appendix. Given 
a program n, by lit(TV) we mean the set of atoms in n. If Z is a splitting set for n and S is 
a set of atoms then by &z(n) and e^(n \ bz(H), S), we mean the bottom part of n w.r.t. 
Z and the evaluation of the top part w.r.t. (Z, S) (see (Lifsc hitz and Turner 1994i for more 
information about these notions). 

Lemma 4 

1 . Let n be a logic program. Suppose n can be divided into two disjoint subprograms 
Hi and H2, i.e., n = Hi U H2 and lit(Hi) D lit(H2) = 0- Then S is an answer set 
for n if and only if there exist two sets Si and S2 of atoms such that S = Si U S2 
and Si and S2 are answer sets for Hi and n2 respectively. 

2. The result in Item 1 can be generalized to n disjoint subprograms, where n is an 
arbitrary integer. 

Proof 

The first item can easily proved by using the splitting set Z = lit(Hi). The second item 
immediately follows from this result. □ 

Proof of Theorem 

Suppose we are given a planning problem instance V = (T),2,Q) and ir^ w (V), where h > 
1 and w > 1 are some integers, returns an answer set S. The proof is primarily based on 
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the splitting set and splitting sequence theorems described in (Lifsch itz and Turner 1994). 
It is organized as follows. We first prove a lemma related to the closure of a set of liter- 
als (Lemma|3- Together with Lemma0] this lemma is used to prove some properties of 
T^h,w{V) (Lemmas |6j0&|8|l. Based on these results, we prove the correctness of 7r/ tjU ,('P) 
in implementing the $ and $ functions (Lemma|9]& LemmafTol. Theorem|2]can be derived 
directly from Lemma ITOl 

Recall that we have made certain assumptions for action theories given to ASCP: (a) for 
every k-proposition determines (a, 77), r\ contains at least two elements; and (b) for every 
static causal law if(/, (j>), <fi is not an empty set. 

The following lemma shows a code fragment that correctly encodes the closure of a set 
of literals. 

Lemma 5 

Let i and k be two integers greater than 0, and a; be a 3-ary predicate. For any set a of 
literals, the following program 

x(l, i, k) <— (I G a) 

x(l,i,k) <— x((p,i,k) (if(Z, (p) e V) 

has the unique answer set {x(l, i, k) \ I G Clj)(a)}. 
Proof 

By the definition of a model of a positive program, it is easy to see that the above program 
has the unique answer set {x(l, i, k) \ I 6 Ox, lmlt } = {x(l,i, k) \ I € Clv(c)} (see Lemma 
□ 

Before showing some lemmas about the properties of 7Tft, u) ('P), let us introduce some no- 
tions and definitions that will be used throughout the rest of this section. We first define 
some sets of atoms which will frequently be used in the proofs of Lemmas [5] - and 
Theorem|2 Then we divide the program 7r/ l tu ('P) into small parts to simplify the proofs. 
In particular, nh,w(V) is divided into two programs 7r£ W (V) and 7r£ W (V). The former 
consists of normal logic program rules while the latter consists of constraints in TTh w (V). 
Then we use the splitting set theorem to remove from 7r£ (V) auxiliary atoms such as 
fluent(. . .), literal(. . .), time(. . .), path(. . .), etc. The resulting program, denoted by 
7To, consists of "main" atoms only. We then use the splitting sequence theorem to further 
split 7To into a set of programs tt^s. Intuitively, each 7^ corresponds to a "cut" of tto at 
time point i. Finally, each 7^ is divided into disjoint subprograms 7r^'s, each of which, 
intuitively, is a "cut" of 7^ at a specific path. 

For 1 < i < h + 1 and 1 < k < w, let Ai^ be the set of all the atoms of the form 
occ(a,i,k), poss(a,i,k), used(i,k), goal(i,k), holds(l,i, k), br(g,i,k,k') (k' > k), 
e(l, i, k), pc(l, i, k), i.e., 

A-i,k — {occ(a,i,k)iPoss(a,i,k) \ a G A} U 

{holds(l 7 i, k), e(l, i, k),pc(l, i, k) \ I is a literal} U 
{br(g, i, k, k') g is a sensed-literal, k < k' < w} U 
{used(i,k),goal(i,k)} (62) 
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and let 

w h+1 

A l= \jA hk , A=\jA l (63) 

k=l i=l 

For a set of atoms SCi and a set of predicate symbols X, by E x we denote the set of 
atoms in S whose predicate symbols are in X and by Si we mean {/ | holds(l,i,k) G 

Observe that nh,wCP) can be divided into two parts (1) 7r ? * t(J ('P) consisting of normal 
logic program rules, and (2) 7r£ tu ('P) consisting of constraints. Since S is an answer set 
for TTh,wCP) U > S is also an answer set for 7r£ W (V) and does not violate any constraint in 

Let V be the set of atoms in irh, w (P) whose parameter list does not contain either the 
time or path variable. Specifically, V is the following set of atoms 

{fluent(f), literal(f), literal(-<f), contrary(f, ->/), contrary^/, f) \ f G F} U 
{sensed(g) \ 3determines(a, 9) e V.g e 9} U {action(a) \ a G A} U 
{time(t) | f G U {timel(i) | f G {l..h + 1}} U {path(p) | p G {l..tu}} (64) 

It is easy to see that V is a splitting set for ir^ W (V). Furthermore, the bottom part byi^ W CP)) 
is a positive program and has only one answer set X n = V. The partial evaluation of the 
top part of 7r£ w (V) with respect to X , 

no = ev(n* h , w (V)\b v (n* hiW (V)),X ), 

is the following set of rules (the condition for each rule follows that rule; and, by default t 
and p are in ranges 1 . . . h and 1 . . . w unless otherwise specified): 

holds(l, 1,1) <- (65) 
(initially(Z) G T) 

poss(a,t,p) <— holds(ip,t,p) (66) 
(executable(a, ip) G V) 
e(l,t, p) <— occ(a,t,p),holds(<p,t,p) (67) 
(causes(a, I, (f>) G V) 
pc(l,t,p) occ(a,t,p),not holds(l,t,p),not holds(cfi,t,p) (68) 

(causes(a, Z, </>) G P) 

br(g,t,p,p) | ... 

| br(g,t,p,w) <- occ(a,t,p) (69) 
(determines(a, 6») eV,g e 9) 
pc(l,t,p) not holds(l,t,p),pc(l' ,t,p),not e(-«p,t,p) (70) 

(if&vOeXM'ep) 

e(Z,i,p) <- e(ip,t,p) (71) 

11 Recall that at at the beginning of this section, we state that 71-^ (P) returns 5 as an answer set. 
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holds(l, t,p) 

goal(t,p) 

goal(t,p) 

holds(l, t+l,p) 
holds(l, t+l,p) 
used(t+l,p) 

holds(g, t+l,p) 

holds(l, t+l,p) 

occ(ai,t,p) \ ... 
occ(a m ,t,p) 
used(l, 1) 
used(t+l,p) 



(if(i,¥>)eX>) 
holds(ip, t,p) 

(if(/,v) eV,l <t < h + l) 

holds(Q, t,p) 

(1 < t < h + l) 

holds(f, t,p),holds(-if, t,p) 

(l<t<h + l) 

e(l,t,p) 

h(l, t,p), not pc(->l, t,p) 
br(g,t,p!,p) 

(Pi < P) 
br(g,t, Pl ,p) 

(Pi < P) 

br(g,t,pi,p),holds(l,t,p 1 ) 
(Pi < P) 

used(t,p),not goal(t,p) 



used(t,p) 

And 7r£ w (V) is the following collection of constraints 

<— occ(a,t,p),not br(9,t,p,p) 



(determines(a, 9) e T>) 

occ(a,t,p),br(g,t,p,p 1 ),br(g,t,p,p 2 ) 
(determines(a, 6) gD, g e 6,p < p\ < p 2 ) 
occ(a,t,p), holds(g,t,p) 
(determines(a, 6) e V,g e 6) 

used(h+l,p), not goal(h+l,p) 
br(g 1 ,t,p 1 ,p),br(g 2 ,t,p 2 ,p) 

(Pi <P2<P) 

br(g 1 ,t,p 1 ,p),br(g 2 ,t,pi,p) 

(.91 # .92, Pi < p) 
br{g,t,p l7 p),used(t,p) 

(Pi < P) 

used(t,p), not goal(t,p), occ(cii, t,p), occ(aj,t,p) 

(1 < i < j < m) 

occ(a, t,p), not poss(a,t,p) 



(72) 

(73) 

(74) 

(75) 
(76) 
(77) 

(78) 

(79) 



(80) 
(81) 
(82) 



(83) 

(84) 

(85) 

(86) 
(87) 

(88) 

(89) 

(90) 

(91) 
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Note that choice rules of the form 

l{Li, . . . ,L n }l <- Body 

have been translated into 

Li | . . . | L n <- Body 

and 

<— Body, Li, Lj (1 < i < j < n) 

By the splitting set theorem, there exists an answer set So for 7r such that S = So U -X" . 
Let J7j be the set of atoms in n whose time parameter is less than or equal to i, i.e., 

i 

Ui = |J Aj (92) 

It is easy to see that the sequence (Ui)^ is a splitting sequence for 7r . By the splitting 
sequence theorem, since So is an answer set for n , there must be a sequence of sets of 
literals {X,}^ such that X t QU t \ U l - 1 , and 

• S = U-=i 1 ^ 

• Xi is an answer set for 

jri=6^(7ro) (93) 

• for every 1 < i < h + 1, Xi is an answer set for 

*i = e Ui (b Ui {n )\b Ui _ 1 {* ), |J X t ) (94) 

i<t<»-i 

Given a set of atoms E, consider rules of the following forms: 

ftoWs(Z, 1,1) <- (95) 
(initially(Z) e X) 

poss(a,t,p) <— holds (ip,t,p) (96) 
(executable(a, tp) e V) 

e(l,t,p) <— occ(a,t,p),holds((j),t,p) (97) 
(causes (a, /, 0) e V) 
pc(l,t,p) <— occ(a,t,p),not holds{l,t,p), 

not holds (-*fi,t,p) (98) 
(causes (a, I, cf>) e V) 

br(g,t,k,p) \ ... 

br{g,t,k,w) <— occ(a,t,p) (99) 
(determines (a, 9) e V, g e 9) 

pc(l,t,p) <— not holds(l,t,p),pc(l' ,t,p),not e(^ip,t,p) (100) 

{tt{l,<p)eV,l'e<p) 
e(/,t,p) <- e(p,t,p) (101) 
(if(J,p)eI>) 
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holds(l, t,p) 

goal(t,p) 
goal(t,p) 
holdsQ, t,p) 

holds(l,t,p) 

used(t,p) 

holds{g, t,p) 

holds(l,t,p) 



occ(ai,t,p) \ ... 
| occ(a m ,t,p) 
used(l, 1) 
used{t,p) 



holds(ip, t,p) 
(if(l,¥>)ei>) 

holds{Q, t,p) 

holds(f, t,p), holds(-if, t,p) 
(e(l,t-l,p) G S) 

(holds(l,t-l,p) G E,pc(-./,i-l,p) £ E) 

(3(g,p').p' <pAbr(g,t-l,p',p) G E) 

(3( 5 ,p')y<PA6r( g) t-l,p',p) GE) 

3(ff,j/).j/<pA&r(ff,t-l,j/,p) G E A 
holds(l,t-l,p') G E) 

used(t,p), not goal(t,p) 
(used{t—l,p) G E) 



(102) 

(103) 
(104) 
(105) 

(106) 

(107) 

(108) 

(109) 



(HO) 
(111) 

(112) 



Then for each i G {1, . . . , /i + 1}, 7T; can be divided into w disjoint subprograms irf , 
X < k < w, where 7r,f is defined as follows 



ll|95|l - ifllMfl . l(TT0|) - ifTTT Jl \t. = l.n=n ifi = l,k=l 

I© - (US}, EH | i = l,p= k} if i = 1, > 1 

(l|96) - <fTT0f) . <fTT2f) |t = it.w = fc.S = X_iT ifl<i</i 

{JT02j) - fT09l) . lfTT2|) 1 1 = h + l,p = fc,E = X h } otherwise 



(113) 



Let Xi^ denote n Aj fc. From Lemma|4l it follows that is an answer set for 71$. 
Hence, we have 

5i,k{S) = S hk (S ) = Si^ k {Xi) = Si t k(Xi t k) 

Due to this fact, from now on, we will use di t k to refer to either Sik(S), Si.k(So), <Si,fc(-Xj), 
or Si^(Xi^). 

We have the following lemma 

Lemma 6 

For 1 < i < h + 1 and 1 < k < w, 



1. if used(i,k) $ S then S does not contain any atoms of the forms holds(l,i,k), 
e(l,i,k), br(g,i,k,k); 
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2. if used(h + 1, fc) G S and 8h+i,k is consistent then 

Sh+i, fe h 

Proof 

1 . We will use induction on i to prove this item. 

a. Base case: i = 1. Let fc be an integer such that used(l, fc) G" S. Clearly we have 
fc > 1. On the other hand, it is easy to see that (by using the splitting set Z = 

j^{hoids,e,br,occ,goai,used}^ jf £ > j tnen g does not contain atoms of the forms 

holds(l, 1, fc), e(l, 1, fc), and br(g, 1, fc, fc). Thus, the base case is true. 

b. Inductive step: Assume that Item 1 is true for i < where j > 1, We will prove 
that it is also true for i = j. Let fc be an integer such that used(j, fc) S. 
Clearly, to prove Item 1 we only need to prove that atoms of the forms e(l,j, fc), 
holds(l,j, fc), br(g,j, fc, fc) do not belong to Xj^, Consider the program in (see 
( II 13» . We know that Xj^ is an answer set for 7rj\ 

Because of rule (II 12> . we have used(j—l, fc) G" Xj-x. From dl07l . it follows that 
br(g, fc', fc) Xj-i for every pair (g, fc') such that fc' < fc. In addition, by the 
inductive hypothesis, we have that for any / and g, e(l,j—l, fc), holds(l,j—l, fc), 
and br(g,j—l, fc, fc) are not in Xj-x. As a result, rules (I105> - (I109> do not exist in 

7T*. If We split 7r| by the Set Z = A {holds,e,br,ooo,used, g oal} ^ ^ ^ ^ Qf 

rules of the forms 

i. 03, (H, flPp-floD, JTTQl if i < h 

ii. ltl02l-llT03t if % = h + 1 

It is not difficult to show that this program has the empty set as its only answer set 
(recall that Q ^ 0). From this, we can conclude the inductive step. 

2. It is obvious because of the rules f73l . (74} and the constraint ( l8oY 

□ 

Lemma 7 

For 1 < i < h and 1 < fc < w, if occ(a, i, fc) G S* then a is executable in 5$ & and there is 
nob a such that occ(b, i, fc) e S. 

Proo/ 

From constraint ( 19 H . it follows that poss(a, i, fc) G 5. Notice only rules of the form (I66> 
may have poss(a, i, fc) as its head. Hence, there must be a proposition (|2j in X> such that tp 
holds in 6^, This means a is executable in So-- 

If there exists b ^ a such that occ(fe, i, fc) G 5 then constraint d90l could not be satisfied. 

□ 

Lemma 8 

for 1 < i < h and 1 < fc < w 

1. if occ(a, i, fc) G S* and a is a non-sensing action then 

a. e(7, i, fc) G S 1 iff ? G e(a, 5^^) 
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b. pc(l, i, k) G S iff I G pc{a, S^k) 

c. -,3{g,k').br{g,i,k',k)&S ' 

2. if occ(a, i,k) € S and a is a sensing action a with occurring in a k-proposition of the 
form (0 in X? and 9 = {gi, . . . ,g n } then there exist n distinct integers fci, . . . , k n 
greater than or equal to k such that 

a. = {br{g j ,i,k,k j )\j&{l : ...,n}} 

b. gj does not hold in 6^, 

c. if kj > k then 5* does not contain any atoms of the form holds(l, i,kj) 

3. if occ(a, i,k) $ S for every action a then 

a. \/l.pc(l, i, k) £ S A e(Z, i,k) S 

b. V(g,k').br(g,i,k,k')?S 

Proof 

Let us split 7r 2 fc by the set Zi = ^{^ sed >9 oai > occ '' io ' d;s >? ,oss )^ gy me splitting set theorem, 
Xi.k — M U N where M is an answer set for bz 1 {nf) and N is an answer set for III = 
ezi C^i \ &Zi (^i )i A^)> which consists of the following rules 

e(l,i,k) «- (114) 
(occ(a, i,k) e M, causes (a, l,<fi) €V, 
holds(4>,i,k) C M) 
pc(l,i,k) <- (115) 
(occ(a, z, fc) G Af, holds{l, i, k) M, 
causes(a, Z, 0) 6 P, holds{-^cj>, i, k) n M = 0) 

br(g,i,k,k) \ ... 

br(g,i,k,w) <- (116) 
(occ(a, i, fc) G M, determines (a, 9) <E V, g e 9) 

pc(l,i,k) pc(l',i,k),not e(-Kp,i,k) (117) 
(if(Z, <p) G X>, /ioZds(Z, i, fc) g M, /' G (ys) 
e(Z,i,A;) <- e((p,i,k) (118) 
(if(Z^)eD) 

From the splitting set theorem, it follows that Si^(M) = Si.k 

1. Assume that occ(a, i,k) £ S and a is a non-sensing action. By Lemma0 we know that 
there exists no sensing action 12 b such that occ(b, i, k) G 5. This means that rules of form 
M 16ft does not exist. Therefore, III can be rewritten to 

e(l,i,k) <— 

(causes(a, 1, 4>) G V, holds(4>, i, k) C M) 

12 Recall that the sets of non-sensing actions and sensing actions are disjoint from each other. Hence, a itself is 
not a sensing action. 
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pc(l,i,k) <— 

(holds(l, i, k) M, causes(a, l,(f>) eV, 
holds(^(j), i, k) n M = 0) 
pc(l,i,k) <— pc(l , i, k), not e(-ttp, i, k) 

(iS(l, <p) G V, holds(l, i, k) <£ M, I' G <p) 
e(l,i,k) <— e(ip,i,k) 

(if(/ )¥ >)eD) 

If we continue splitting the above program using Z2 = then by Lemma|5j the bottom 
part has the only answer set 

{e(l,i,k) I I G e(a,<5 l!fc )} 
and the evaluation of the top part has the only answer set 

{pc(l,i,k) I I G pc(a, <5 ijfc )} 

Due to the fact that M does not contain any atoms of the form e(l, i, k) or pc(l, i, k), we 
therefore can conclude Items (a) and (b). 

We now show that ->3(<7, k') .br(g,i, k' , k) G 5. Suppose otherwise, i.e., there exists g 
and fe' such that br(g, i, k' , k) G 5. Notice that only rule (I80i with f = « and p = k has 
occ(a, z , k) in its head. Hence, its body must be satisfied by S. That implies used(i,k) G 5. 
On the other hand, since only rules of the form (I69> with p = k' may have br(g,i,k' ,k) in 
its head, there exists a sensing action b such that occ(b, i, k') G 5 and in addition, fc' < fc. 
As the sets of non-sensing actions and sensing actions are disjoint from each other, we 
have b ^ a. From Lemma0 it follows that k' < k. 

Accordingly, we have used(i, k) G S, br(g, i, k' , k) G S and k' < k. Constraint d89l with 
t = i, p = k, and p\ = k' is thus violated. Thus, Item (c) holds. 
2. Assume that occ(a, i,k) G S and a is a sensing action occurring in a k-proposition of the 
form <0 in X> with6> = {gi, . . . , <?„}. 

In this case, since rules of the forms (II 14i and il 15i do not exist, ITi is the following set 
of rules 

br(g 1 ,i,k,k) | ... 
br(gi,i,k,w) <- 

br(g n ,i,k,k) \ ... 
br(g n ,i,k,w) <- 

pc(l,i,k) *— pc(l' ,i,k),not e{-tip,i,k) 

(if (Z, g O, holds(l, i, k) <£M,l' G ^) 
e(l,i,k) i— e(ip,i,k) 

(if(Z,¥>)eX>) 

By further splitting the above program using the set J 4|^ pc ^, we will see that the bottom 
part has the empty set as its only answer set (recall that we are assuming that the body 



50 



Phan Huy Tit, Tran Cao Son, and Chitta Baral 



of each static law of the form @ is not empty). Therefore, the answer set for the above 
program is also the answer set for the following program and vice versa. 

br(gi,i,k,k) | ... 

br(g 1 ,i,k,w) <- 

br(g n ,i,k,k) | ... 

br(g n ,i,k,w) <- 

Thus, there exist n integers fci, . . ., k n greater than or equal to fc such that 

n 

N= {Jibrig^^k^j)} 
j=i 

It is easy to see that XfZ^ = N^ br K In addition, by constraints of the form J88i . kj's must 

be distinct. Thus, Items (a) is true. 

Item (b) can be drawn from constraints of the form J85t . 

Assume kj > k. Because of constraints of the form (j89jl, we have used(i, kj) ^ S. From 
Lemma|6j it follows that S does not contain any atoms of the form holds(l, i,kj). Item (c) 
is thus true. 

occ(a, i,k) £ S for every action a. In this case, ITi is the following set of rules 

pc{l,i,k) <— pc(l',i,k),not e(^ip,i,k) 

(if(7, <p) S V, holds(l, i, k) M, I' G ip) 
e(l,i,k) <— e(tp, i, k) 

which has an empty set as its only answer set. Items (a)-(b) follow from this. 
□ 

The following lemma shows that TVh^i'P) correctly implements the transition function $. 
Lemma 9 

For 1 < i < h and 1 < k < w 

1. if there exists a non-sensing action a such that occ(a, i,k) e S then 

_ , - v f if (5,+i fc is inconsistent 

$(a,5 i>k ) = i l+L ' K ; 

L \Oi+i,k\ otherwise 

2. if there exists a sensing action a occurring in a k-proposition of the form in V with 
9 = {<7i, . . . , g n } such that occ(a, i,k) £ S then there exist n integers {ki, . . . , k n } 
such that 

®(a,8i,k) = {Si+i.kj | 1 < j < n,5 l+1 . kj is consistent}, 

and for each j, gj holds in Si+i^kj ; 

3. if occ(a, i,k) S for every action a, 
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Proof 

1. Assume that there exists a non-sensing action a such that occ(a, i, k) S X{. 

Observe that Z\ = A^° 1 ^ is a splitting set for 7rf +1 . Hence, by the splitting set theorem, 
Xi+i.k = M U N, where M C Z\ is an answer set for 111 = bz 1 (Tr'i +1 ) and N is an 
answer set for 1T2 = ez 1 (7rf +1 \ ITi, M). 

Notice that by Lemma[8] rules J 1 08i — dT09t for t = i + l,p = kdo not exist. Thus, 111 is 
the following set of rules: 

holds(l,i+l,k) <— holds(ip,i+l,k) 
(tf(i,V)6l») 

holds(l,i+l,k) <— 
holds(l, k) 



(e(l,i,k) e Xi) 



(holds(l, i, k) G Xi,pc(-<1, i, k) $ Xi) 

Also by Lemma [8] the conditions for the second and third rules can be written as (I 6 
e(a,Si t k)) and (/ £ At,*, - "/ ^ pc(a, tf^fc)) respectively. Thus, by Lemma[5] II 1 has the 
unique answer set 

M = {holds(l, i+1, k) \ I e C7p(a, fy k )} 

On the other hand, by Lemma[7] a is executable in Jj^. From the definition of the Res-p 
and $ functions, it follows that 



$(Mi,fe) 



if is inconsistent 

{^+1^.} otherwise 



2. Assume that there exists a sensing action a with a k-proposition of the form (|5} and 9 = 
{<?i, . . . , g n } such that occia, i, k) G S. 

By Lemma[8] for each j e {1 . . . n}, there exists fcj > k such that br(gj, i, k, kj) E Xi. 
It is easy to see that Zi = Aj^^ 8 } is a splitting set for 71"^. Considering cases kj = k 
and kj > k in turn and observe that holds (I, i, kj) $ S if fcj > fc, we will see that in both 

k ■ 

cases bz 2 (7^+1) is the following set of rules: 

holds(l,i+l,kj) <— holds{ip,i+\,kj) 
(if(^)eD) 

holds(l,i+l,kj) <— 

(holds(l,i,k) e Xi) 

holds(gj,i+l,kj) <— 
By Lemma|5] the only answer set for the above program is 

M = {holds{l,i + l,kj) \ I e Cl v (6 itk U {gj})} 

On the other hand, by Lemma0 a is executable in S^k and by Lemma|8] gj does not hold 
in Si i fe. Thus, according to the definition of the transition function, we have 

$(a, <5i, fe ) = {C/i>(5t,*! U {flfj}) I 1 < j < n, Cl v (k,k U {gj}) is consistent} 



52 



Phan Huy Tit, Tran Cao Son, and Chitta Baral 



Hence, we have 

$(a. 6 ilk ) = {Si+i,^ (M) | 1 < j < n, S l+1 . kj (M) is consistent} = 

{S i+ x >kj | 1 < j < n, 5 i+ i. k] is consistent} 

and obviously, gj holds in ■ 
, Assume that occ(a, i,k) £ S for every action a. 
Similar to the first case, Z\ is a splitting set for 7rf +1 . bz 1 (TTi +1 ) is the following set of 
rules: 

holds{l 1 i J rl^k) «— holds(ip,i+l, k) 

(if(Z^)eP) 

holds(l,i+l, k) «— 

(holds{l, i,k) G Xi) 

Because that is an a-state (Lemma|6j, by Lemma[5]the only answer set for this program 
is 

M = {holds{l, i + 1, k) \ I € S^ k } 

Thus, we have 
□ 

The following lemma shows that 717^ ('P) correctly implements the extended transition 
function. 

Lemma 10 
We have 

1. is the initial a-state for V. 

2. For every pair of integers 1 < i < h+1, 1 < k < w, if used(i, k) G S then 

a) Pi(S) is a conditional plan 

b) furthermore, if is consistent then for every 6 G <^(p^(S),Si^), S \= Q. 
Proof 

Z\ = A s d l ° lds } j s a splitting set for tt\. The bottom part, bz 1 (t^I), consists of the following 
rules: 

holds(l, 1,1) <— 

{initially(0 G 1} 

holds(l, 1,1) <— holds((p, 1, 1) 
{»(!,¥>) £D} 

By Lemma|5] the only answer set for the above program is 

M = {holds(l, 1, 1) | / G <5i} 
where 5i is the initial a-state of T 7 . Thus, = ^^(M) is the initial a-state of T 5 . 
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2. We now prove Item 2 by induction on parameter i. 

a. Base case: i = h+l. Let k be an arbitrary integer between 1 and w such that 
used(i, k) £ S. Clearly p^(S) = [] is a conditional plan. 

Now suppose that Si t k is consistent. According to the definition of the extended 
transition function, we have 

*(fc fc (S),M = *(DA*) = {*<,*} 

On the other hand, by Lemma [6] we have that Si.k \= Q. Thus, Item 2 is true for 
i = h+l. 

b. Inductive step: Assume that Item 2 is true for all h + 1 > i > t. We will show that 
it is true for i = t. Let k be an integer between 1 and w such that used(t, k) £ S. 
Consider three possibilities: 

i. occ(a, t,k) £ S for some non-sensing action a. By the definition of Pt(S), we 
have Pt(S) = [a; p% +1 (S)]. In addition, by rule d82l we have used(t+l, k) £ 
S. Thus, according to the inductive hypothesis, p^ +1 (S) is a conditional plan. 
Accordingly, p*(5) is also a conditional plan. 

Now suppose that 5^ is consistent. Consider two cases 

- 5t+i.k is consistent. We have 

6(p*(S)A,ib) = $([a;p t fc +1 (5)],<5 t , fe ) = ^(^(^^t+i.fe) 

(by Lemma|9]and by the definition of the extended transition function). 
On the other hand, according to the inductive hypothesis, for every S in 
&(Pt+i(S), #t+i,fc), <5 (= 5- Hence, the inductive step is proven. 

- #t+i,fc is inconsistent. By Lemma|5] we have $(j>j (S), 5 tl k) = 0. Thus, the 
inductive step is proven. 

ii. occ(a, t,k) £ 5 for some sensing action a with a k-proposition of the form (|5} 
and = {gi, . . . ,g n }. By Lemma [8] there exist exactly n integers k\, . . . , k n 
greater than k such that br(gj,t, k, kj) e S for 1 < j < n. This implies that 
usedlt + 1, fcj-) e S (see rules (77) and Thus, by the definition of Pt(S), 
wehavep^(5) = [a; cases ({gj —> p%i(S)} 7 j =1 )]. On the other hand, we know 

k ■ 

by the inductive hypothesis that p i H i _ 1 (S) is a conditional plan for 1 < j < n. As 
a result, Pt(S) is also a conditional plan. 

Suppose <5i.fc is consistent. Let J = { j \ St+i.kj is consistent}. By Lemma[9] we 
have 

$(o,*t,fc) = {5t+i,k, | j e J} 
and holds in ^t+i.fej for every 1 < j < n. Hence, by the definition of $, we 
have 

$( ft fe (s)A fc ) = y $(^ +1 (s),5 t+1 , fci ) 

According to the inductive hypothesis, for every 5 G $(pf +1 (S'), 8t+\,kt )> where 
j G J, we have 5 |= Q. This implies that for every 6 £ <5t.fc), we have 

s he. 
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hi. There is no action a such that occ(a, t, k) S S. According to the definition of 
Pt(S), Pt{S) — []. Hence, it is a conditional plan. 

It is easy to see that goal(t, k) 6 S, which means that either 8t,k is inconsistent 
or 8t,k \= Q (see rules d73i . J74L and d80» . Now suppose that & t ,k is consistent. 
This implies that St k \= Q- We have 

% t fe (S),£ t , fc ) = $([], 6t, k ) = {S t , k } 

Thus, the inductive step is proven. 

□ 

Theorem|2]immediately follows from Lemma ITOl 

Proof of Proposition^ 

First, we prove the following lemma. 
Lemma 11 

Let V = (T>, J, Q) be a planning problem instance, 8 be an a-state and p be a plan. If 
J) |= £ then $(reduct«5(p), 8) (= 

Let us prove the lemma by structural induction on p. 

1. p =\\. 

The proof is trivial since reduct d -(p) = p = Q. 

2. Assume that p = [a; q], where q is a conditional plan and a is a non-sensing action and 
the lemma is true for q. 

Suppose $(p, 8) \= Q. We need to show that $(reduct,5(p), 8) f= Q. 
If 8 f= £then 

$(reduct 5 (p),<J) =&(W,S) = {S}\=g 
Now consider the case that 8 ^= Q. 

Clearly, we have <&(a, 8) / _L. Therefore, $(a, <5) = {8'} for some <5'. Hence, by the 
definition of reduct, we have 

reduct^p) = a; reduct^/ (q) 

Thus, 

$(reducta(p), S) = i>(reduct,y (q), 8') 
On the other hand, we have 

$(p,8)=$(q,8') 

Because <l(p, <5) |= Q, we have 

By inductive hypothesis, we have 

$(reducV(g),<5') \= Q 
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Hence, 

$(reduct,5(>),<5) |= Q 

. Assume that p = [a; cases({gj — > Pj}™ =1 )], where a is a sensing action that senses 
gi, . . . , g n , and the lemma for pj 's. 

Suppose $(p, 5) \= Q. We need to show that $(reduct,5(p), 5) \= Q. 
If 5 \= £then 

<l(red U ct 5 (p),<5) - $([],<5) - {6} \= Q 
Now consider the case that 5 \/= Q. There are two possibilities. 

a) there exists gk such that gk S 8. By the definition of reduct, we have 

reduct^p) = reduct^pfc) 
By the definition of the ^-function, it is easy to see that 

$(P,5) = $(pk,S) 

Since <l(p, 5) \= Q, we have <&(pk, S) \= Q. By the inductive hypothesis, we have 

$(reduct«5(p fc ),<5) |= Q 

Hence, we have 

l>(reduct,5(p),<5) |= Q 
a) for every 1 < j < n, gj £ 5. By the definition of reduct, we have 

reduct^p) = a; cases({#j -» qj}] =1 ) 

where 

if Clj>(S U {gj}) is inconsistent 

1j 



{ 



reduct C j I ,( 1 5u{g J })(Pj) otherwise 

For every 1 < j < n, let 5j = Clx>{5 U {ft})- Let J = { j \ Sj is consistent}. It is 
easy to see that 

#(p,5)= U*(P^) 

because holds in <5j but for every fc ^ j, g k does not hold in Sj. 
Because $(p, 5) |= Q, we have 



for every j G J. 

On the other hand, we have 



D if J 2 J 

reduct^. (pj) otherwise 



Thus, 

$(reduct 5 (p),5) = (J ®{qj,6j) = [j $(reduct 5j . (p.,), <*j) 
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By the inductive hypothesis, for every j G J, as &(pj, Sj) \= Q, we have ^(reduct^ (pj), 5j) \= 
Q. As a result, we have 

$(reduct a (p),£) |= Q 

□ 

We now prove Proposition^ Let p be a solution to V. From the construction of reduct, 
it is easy to see that reduct,5(p) is unique. 

By LemmafTTI we have that $(reduct^(p), 5) j= Q. Thus, reducta(p) is also a solution 
to V. 

So, we can conclude the proposition. 

Proof of Theorem^ 

The idea of the proof is as follows. Let q be reduct^p), where 5 is the initial a-state of 
V, and let T q be the labeled tree for q numbered according to the principles described in 
Section|3] Let h and w denote the height and width of T q respectively. For 1 < i < h + 1, 
1 < fc < w, we define to be the a-state at node (i, fc) 13 of T q if such a node exists and 
_L otherwise. Based on T q and S^h, we construct the set Y^k of atoms that hold at node 
(i 1 k). Then we prove that the union of these sets, denoted by Sq, is an answer set for tto 
(rules d65> - (l82» by showing that each is an answer set for a part of 7To, denoted by 
n'i . Furthermore, a set S' can be constructed from S' Q in such a way that it is an answer set 
for 7r£ w (V). Moreover, S" does not violate any constraints in 7r£ w (rules ( I83> - (I91» . As 
such, it is an answer set for 7r/ li „,('P). Moreover, q = p\(S'). 

Given the numbered tree T q , by (a, i, k) we mean the node labeled with a and numbered 
with (i, k) in T q ; by (g, i, k, k') 6 T q we mean the link, whose label is g, between the 
nodes (i, k) and (i + 1, fc') in T q . 

For 1 < i < h + 1, 1 < k < w, we define the a-state 6n- as follows. 



i. if i 



Cl v ({l | initially(Z) el}) if fc = 1 

1 iffc>l 1 ' 



if i > 1 



ClD{e{a,8i-i tk )U{8i-x tk \pc{a,8 i -\,k))) if (a,i-l,fc) e for 

a non-sensing action a 
Clv(Si-i,k' U {g}) if (g,i-l,k',k) eT q 

<5i_i fe otherwise 

(120) 

Note that given (i, fc), there exists at most one action a such that (a, k) G T q , and fur- 
thermore, at most one pair (g, fc') such that (g, fc', fc) G r g . In addition, the conditions 
in Equation (11201 do not overlap each other. Thus, S^k is uniquely defined for 1 < i < h+ 1 
and 1 < fc < w. In what follows, the undefined situation _L can sometimes be thought of 
as 0, depending the context in which it is used. 

Let us construct the set Yi^ of atoms based on 5i t k as follows. 

13 That is, the node numbered with (i, k) in T q 
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1. used(l, 1) e Yi,i 

2. holds(l, i, k) G YJ,* iif Z 6 S^k 

3. poss(a, i, k) G Y^fc iif there exists a proposition of the form s.t. ijj C <5jfc 

4. occ(a, i, fc) G Yi t k iif (a, i, fe) G T g 

5. &r(p, j, k, k') G Y^fc iif (<?, i, fc, fc') G T q for some g, fc' 

6. e(Z, i, fe) G Yi ie iif (a, i, fc) G T 9 and Z G e(a, dj^) for some non-sensing action a 

7. pc(Z, i, k) G iif (a, i, fe) G T g and Z G pc(a, for some non-sensing action a 

8. For i > 1, used(i, k) G Y^fe iif either 

(a) used(i— 1, fc) G Y*-i,k; or 

(b) there exists (g,k') s.t. (g,i— l,k', k) G Yi-\^> 

9. goal(i, k) G Y$ & iff |= 5 or 5^/. is inconsistent 
10. Nothing else in Y^k 

Clearly, Fife's are uniquely defined. Furthermore, they are disjoint from each other. Let 

w h+1 

Yi=\J Yi, k and S' =\jYi 

k=l i=l 

Lemma 12 

For 1 < i < h and 1 < k < w, let M = Y ^ olds ' poss ' 3 ° al ' used ' oc ^ and let n be the 
following program: 



e(l, i, k) 
pc(l, i, k) 



br{g,i,k,k) \ ... 
br(g,i,k,w) 



(occ(a, i, k) G M, causes(a, Z, cf>) G V, holds(cf>, i, k) C M) 

(occ(a, i, k) G M, causes(a, Z, <j>) G V, 
holds(l, i, k) ^ M, holds(^<j), i, k) n M = 0) 



(occ(a,i,k) G M, determines (a, 6) e V, g e 

pc(l,i,k) <— pc(l',i,k),not e(->cp,i,k) 

(if(Z, <p) G P, /ioZds(Z, i, k) £ M, V G y>) 
e(l,i,k) <— e(ip,i,k) 

(if(l^)ev) 

Then, = y.'<^' pc ' br } j s an answer set for II. 
Proof 

Given (i, k), there are three cases that may happen at node (i, k). 

there exists a non-sensing action a such that (a, i, k) G T 9 ; 
there exists a sensing action a such that (a, i, fe) G T q ; 
(a, i, k) T q for every action a 
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Let us consider each of those in turn. 

1. there exists a non-sensing action a such that (a, i, k) G T q . 

From the construction of Y^k, we know that occ(a, i, k) G M and there is no b ^ a such 
that occ(b, i, k) G M. Furthermore, due to the fact that N does not contain any atom of 
the form holds(l, i, k), we have holds(l, i, k) G M iff holds(l, i, k) G Yi tk - That means 
holds{l, i, k) G M iff I G <5 j; , fc . 
Hence, II can be rewritten to: 

e(l,i,k) <— 

(I G e(a,S i<k )) 

pc(l,i,k) *— 

(I epc°(a,5i, k )) 
pc{7,j,/c) <— pc(l' ,i,k),not e(^ip,i,k) 

e(l,i,k) <— e((p,i,k) 

(If(l, v )e2?) 

As have been seen in the proof of Theorem|2](see the proof of Lemma[8] Item 1), the only 
answer set for this program is {e(l, i, k) \ I G e(a, 6^)} U {pc(l, i, k) | I G pc(a, (5^)} = 
N. 

2. there exists a sensing action a such that (a, i, k) G T q . 

We have occ(a, i,k) G M and there is no non-sensing action b such that occ(b, i, k) G M. 
As a result, II is 

br(g,i,k,k) \ ... 

br(g,i,k,w) «- 

(occ(a, i, k) G M, determines (a, 0) eV,g e 9) 
pc(l,i,k) <— pc(l' ,i,k),not e(-i<p,i,k) 

(if(Z, G £>, holds(l, i, k) <£ M,l' G 
e(l,i,k) <— e(ip,i,k) 

It is easy that an answer set for IT is also an answer set for 

br(g,i,k,k) \ ... 

br(g,i,k,w) *— 

(occ(a, i, k) G M, determines (a, 0) eV,g e 9) 
and vice versa. On the other hand, 

is an answer set for the latter program. As a result, N is also an answer set for II. 

3. (a, i, k) ^ T q for every action a. 

In this case, the first three rules of IT do not exist because occ(a, i,k) M for every a. 
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Thus, II consists of the last two rules only. It is easy to see that it has the empty set as its 
only answer set. On the other hand, from the construction of Y^k, we have v'® ,pc = 0. 

Accordingly, Y^' pc ' br ^ is an answer set for II. 
The proof is done. □ 

Lemma 13 

For 1 < i < ft + 1, 1 < k < w, Y^fc is an answer set for 7r'j , where tt'^ is defined in the 
same way as 7r 2 fc except that we replace every occurrence of X in Equation J 1 1 31 by Y. 

Proof 

Let us consider in turn two cases i = 1 and i > 1. 

i = 1. It is easy to see that the only answer set for -k'\, where k > 1, is 

Y\,k = {poss(a, 1, k) | executable(a, 0) e V} 

by using the splitting set A{ h ° lds > occ ' br ' used ' e >P c } ( see (|62l for the definition of Ai.k) and 
observe that the bottom part has the empty set as its only answer set and Yi t k is the only 
answer set for the evaluation of the top part. 

We now prove that Y\\ is an answer set for n 1 ' 1 which consists of the rules of the forms 
CSl-CED, CEB-di where t = 1 andp = 1. If we use the set Z x = A V^ds,occ,poss, g oai,us 
to split tt'\ then &Zi(t'i) is 



{63-63,CSa-jng,CE3,ini) \t = i, P = i} 

of 

answer set for bz x (tt'i)- Furthermore, we have 



r-< tx. j n 'a.' r -ii ^ 7i r ^ r\ holds ,occ,poss , goal , used} . 

From the definition of Y\ t \, we can easily show that M = Y 1 \ is an 



Si A (M) = S 1A (Y 1A ) = 5 lrl 
The evaluation of the top part, ITi = ez 1 (ir'{ \ bz x (tt'i), M), is the following set of rules 

e(I,l,l) «- 

(occ(a, 1, 1) € M, causes(a, I, (f>) e V, 
holds((j>, 1, 1) C M) 



pc(l, 1,1) 



6r( 3 ,l,l,fc) | ... 

1,1, w) 



(occ(a, 1, 1) G M, causes(a, Z, 4>) e V, 
holds(l, 1, 1) £ M, holds(-^<p, 1, 1) n M = 0) 



(determines (a, 0) eU,g e 9, occ(a, 1, 1) e M) 
pc(Z,l,l) <— pc(Z', 1, 1), not e(->(p, 1, 1) 

(if (Z, <p) 6D,l'e v?j holds(l, 1, 1) £ M) 
e(/,l,l) 4- e(^,l,l) 

(tf(z,¥>)ex>) 
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By Lemma IT2l N = Y^' pc,br ' is an answer set for ITi. As a result, Y\ \ = M U N is an 
answer set for n' \ . 
2. 1 < i < h+ 1. 

Using the splitting set Z2 = j^ h ° lds - occ ^ oal ^ used 'P oss ^ [ Q split 7r'^, we have that the bottom 
part II2 = bz 2 {^'i) consists of rules of the forms 

• (|%}, (luHMTTol. and (E3 if i < ft. 

• (fT02t-(fT09t. and (11721 if i = h + 1 

We now prove that M — Y i k is an answer set for II2. Let us further 

split II2 by the set Z% — j\j h ^ lds \ Then, the bottom part bz 3 (lb) consists of rules of the 
forms (E03, (G05}-(IT06}, $UM-ftM only. 
Consider three cases 

a. there exists a non-sensing action a such that occ(a, i— 1, k) G 3^-i. 

From the construction of li fe's, it is easy to see that there exists no {g, k 1 ) such that 
br(g,i—l,k',k) G Yi-\- Thus, bz 3 (n 2 ) contains rules of the forms (I102i . J105i — 
J106i only. On the other hand, we have 

e(/, fc) G Fi_ a iff / G e(a, 

pc(-i/, k) <j£ Yi-i iff ^ pc(a, £;-i,fc) 
Hence, bz 3 (1T2) is the following collection of rules: 

holds^^i, k) <— holds(ip,i, k) 



holds(l,i,k) <— 
holds^^ijk) «— 

By Lemma|5] it has the only answer set 



(if(/,^)eP) 

(Z G e(a, <5i_i,fe)) 

(Z e 6i-i, k , ->l & <5i-i,fc) 



{holds(l,i,k) I Z G C/p(e(a,(5i_i,ife) U (£i_i,jfc \ pc(a, 6,-1, fc )))} = 

b. 3(g,k').br(9,i-l,k',k) G Y^-i. 

From the construction of y/ fc s, such (g, fc') is unique and in addition k' < k. Thus, 

&z 3 (n 2 )is 

holds{l,i,k) <— holds((p,i,k) 

GC) 

holds(l,i,k) <— 

(0 G 5i_i, fc ) V (*' < fe A i G 5i-i lfc /)) 

holds(g,i,k) <— 
or equivalently, 

holds(l,i,k) <— holds(cp,i,k) 
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(if 

holds(l,i,k) <— 

(I G Si- Xtk , U {g}) 

since if k' < k then 5i-x,k = 0. By Lemma[5] this program has the only answer set 

{holds(l, i, k) | Z G Clv(Si~i,k' U {5})} = {holds(l, i,k)\l e S it k} 

Hence, y^ oWs l j s the only answer set for (II2). 
c. occ(a,i— 1, fe) £Yi—iforeverynon-sensingactionaandV(g,k').br(g,i—l,k',k) 

From the construction of Y^fc's, it follows that e(Z, i— 1, fe) andpc(Z, i— 1, fe) g 1 
for every Z. Hence, (II2) is the following set of rules 

holds(l,i,k) *— holds((p,i,k) 
(if(l,ip)GV) 

holds(l,i,k) «— 

(Z 6 **-i,fe) 

whose only answer set is 

{holds(l,i,k) I Z G <5i_i, fc } = {holds(l,i,k) I Z G 5 iife } = Y^ olds} 

So, in all three cases, we have Y^ olds ^ is an answer set for bz 3 (II2). 
Hence, n3 = ez 3 (IL2 \ bz 3 (II2), ° Ws ^) is the following set of rules: 



poss(a,i,k) <— 



used{i, k) 
goal{i, k) 
goal(i, k) 



(executable(a, ip) G X> C ^ jfe ) 
(3( 5 , < fc, &r(ff, fc', fc) G 
(5 c * <ifc ) 

is inconsistent) 



occ(ai,i, k) I . . . 

I occ{a m , i, k) <— used{i, k), not goal (i,k) 



used{i, k) 



{used(i—\,k) G 



It is easy to see that Y^ oss ' used ' 9 ° al ' occ ^ is an answer set for Accordingly, we have 

= £ is an answer set tor II2. 

II4 = e^ 2 (77',- \ n 2 , M) is thus the following set of rules: 

e(l,i,k) <— 

(occ(a, i, fc) G M, causes(a, Z, 0) 6f, holds(cf>, i, k) C M) 
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pc(l,i,k) <— 

(occ(a, i, k) E M, causes (a. I, cf>) e V, 
holds(l, i, k) g M, holds(-np, i, k) n M = 0) 

br(g,i,k,k) \ ... 

br(g,i,k,w) <- 

(occ(a, i,k) E M, determines(a, 9) e V, g e 9) 

pc(l,i,k) <— pc(Z , z , fc) , not e(->ip , i , k) 

(if (I, ip) E P, fto/ds(/, i, fc) £ M, I' E <p) 
e(l,i,k) <— e((p,i,k) 

(if(l^)ev) 

By LemmalT2l iV = yy(f' pc ' 6T ^ i s a n answer set for II4. 
As a result, Y^k = M U N is an answer set for n 1 ^. 

□ 

Lemma 14 
We have 

1 . S" = (Ji^i Yi U is an answer set for nh,w {P), where X$ = V is defined in J64l >. 

2. j4(S')=3 

Proof 

1. Since Yi t k is an answer set for n \ and -k \ 's are disjoint from each other, we have Yi is an 
answer set for n'i, where n'i is defined in the same way as 7Tj except that every occurrence 
of X in Equations d93l and ( I94l > is replaced with Y. From the splitting sequence theorem, it 
follows that S' = [j^i Yi is an answer set for ttq. Thus, S' is an answer set for nt W (V). 
On the other hand, it is not difficult to show that S' satisfies all constraints in tt^ W (V) 
based on the following observations. 

• If occ(a, i, k) E Yi,k for some sensing action a which occurs in a k-proposition of 
the form then there exists g in 9 such that br(g, i, k, k) E Yj f-- Furthermore, for 
every g' E 9, g' does not in 5^k- The latter property holds because that q does not 
contain an action that senses an already known-to-be-true literal. 

• If used(h + 1, k) E Y h+ x >k then S h+lyk \= Q. 

• 5i t k is either _L or an a-state. This means that Y.^ olds ^ does not contain two atoms 
of the forms holds(l, i, k) and holds(l', i, k), where I and V are contrary literals. 

• No two branches come to the same node (i,k). 

• If used(i, k) E Yi then br(g, i, k', k) Yi for any pair (g, k'), k' ^ k. 

• if {a, i, k) E T q then a must be executable in 8^- 

Accordingly, we have S is an answer set for irh. w (V). 

2. Immediate from the construction of Yi ,t ■ 

□ 

Theorem[3]follows directly from this lemma. 
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Appendix C - A Sample Encoding 

This appendix contains the encoding of the planning problem V\ in Example [2] The first 
subsection describes the input planning problem. The next subsection presents the corre- 
sponding logic program 7r/ t)W ('Pi). The last two subsections are the outputs of smodels 
and cmodelswhen this logic program is run with the parameters h = 2 and w = 3. 

Input Domain 

% A possible plan is 

% check; cases (open-> [ ] ; closed-> [ f lip_lock] ; locked-> [ ] ) 
% fluents 
fluent (open) . 
fluent (closed) . 
fluent (locked) . 

% actions 
action (check) . 
action (push_up) . 
action (push_down) . 
action (flip_lock) . 

% executability conditions 
executable (check, [ ] ) . 
executable (push_up, [closed] ) . 
executable (push_down, [open] ) . 
executable (f lip_lock, [neg (open) ] ) . 

% dynamic laws 

causes (push_down, closed, []) . 
causes (push_up, open, [ ] ) . 
causes (f lip_lock, locked, [closed] ) . 
causes (f lip_lock, closed, [locked] ) . 

% knowledge laws 

determines (check, [open, closed, locked] ) . 
% static laws 

oneof ( [open, closed, locked] ) . 
% initial state 

initially (neg (open) ) . % window is not open 
% goal 

goal (locked) . % window is locked 

Encoding 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% Usage: 

% lparse -c h=<height> -c w=<width> | smodels 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
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#domain fluent (F) . 
tdomain literal (L; LI ) . 
#domain sense (G; Gl ; G2 ) . 
#domain time(T) . 
#domain timel (Tl) . 
#domain path (P ; PI ; P2 ) . 
tdomain action (A) . 

% Input parameters 
time ( 1 . . h) . 
timel (1 . .h+1) . 
path ( 1 . . w) . 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% Action declarations 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
action (check) . 
action (push_up) . 
action (push_down) . 
act ion ( f lip_lock ) . 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% Fluent declarations 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

fluent (open) . 

fluent (closed) . 

fluent (locked) . 

sense (open) . 

sense (closed) . 

sense (locked) . 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% DOMAIN DEPENDENT RULES 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% Initial situation 
holds (neg (open) ,1,1) . 

% Executability conditions 
poss ( check, T, P ) . 

poss (push_up, T, P ) :- 

holds (closed, T, P) . 
poss (push_down, T, P ) :- 

holds (open, T, P) . 
poss (f lip_lock, T, P) :- 

holds (neg(open) ,T,P) . 

% Effects of non-sensing actions 
e (closed, T+l, P) :- 

occ (push_down, T, P ) . 
pc (closed, T+l, P) :- 

occ (push_down, T, P ) . 
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e (open, T+l, P) :- 

occ (push_up, T, P ) . 
pc (open, T+l, P) :- 

occ (push_up, T, P ) . 
e (locked, T+l, P) :- 

occ (flip_lock, T,P) , 

holds (closed, T, P) . 
pc (locked, T+l, P) :- 

occ (flip_lock, T,P) , 

not holds (neg ( closed) , T, P ) . 
e (closed, T+l, P) :- 

occ (f lip_lock, T, P) , 

holds (locked, T, P) . 
pc (closed, T+l, P) :- 

occ (f lip_lock, T, P) , 

not holds (neg (locked) , T, P ) . 

% Effects of sensing actions 
:- occ ( check, T, P ) , 

not br (open,T,P,P) , 

not br (closed, T, P, P) , 

not br (locked, T,P, P) . 
1 {br (open, T, P, X) :new_br (P, X) } 1 :- 

occ (check, T, P) . 
1 {br (closed, T, P, X) :new_br (P,X) } 1 :- 

occ (check, T,P) . 
1 {br (locked, T, P, X) :new_br (P, X) } 1 :- 

occ (check, T,P) . 
:- occ ( check, T, P ) , 

holds (open, T, P) . 
: - occ (check, T, P) , 

holds (closed, T, P) . 
: - occ (check, T, P ) , 

holds (locked, T, P) . 

% Static laws 

holds (neg (open) , Tl , P ) :- 

holds (closed, T1,P) . 

e (neg (open) , T+l, P) :- 

e (closed, T + l, P) . 

pc (neg (open) , T+l, P) :- 

pc (closed, T+l, P) , 

not holds (neg (open) , T, P ) , 

not e (neg (closed) , T+l, P) . 

holds (neg (open) , Tl, P) :- 

holds (locked, T1,P) . 

e (neg (open) , T+l, P) :- 

e (locked, T + l, P) . 
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pc (neg (open) , T+l , P ) :- 

pc (locked, T+l, P) , 

not holds (neg (open) , T, P ) , 

not e (neg (locked) , T+l, P) . 

holds (open, Tl, P) :- 

holds (neg (closed) , Tl, P) , 
holds (neg (locked) , Tl, P) . 

e (open, T+l, P) :- 

e (neg (closed) , T+l, P) , 
e (neg (locked) , T+l, P) . 

pc (open, T+l, P) :- 

pc (neg (closed) , T+l, P) , 
not holds (open, T, P) , 
not e (closed, T+l, P) , 
not e (locked, T + l, P) . 

pc (open, T+l, P) :- 

pc (neg (locked) , T+l, P) , 
not holds (open, T, P) , 
not e (closed, T+l, P) , 
not e (locked, T + l, P) . 

holds (neg (closed) , Tl, P) :- 
holds (open, Tl, P) . 

e (neg (closed) , T+l, P) :- 
e (open, T+l, P) . 

pc (neg (closed) , T+l, P) :- 
pc (open, T+l, P) , 
not holds (neg (closed) , T, P ) , 
not e (neg (open) , T+l, P ) . 

holds (neg (closed) , Tl, P) :- 
holds (locked, T1,P) . 

e (neg (closed) , T+l, P) :- 
e (locked, T+l, P) . 

pc (neg (closed) , T+l, P) :- 
pc (locked, T+l, P) , 
not holds (neg ( closed) , T, P ) , 
not e (neg (locked) , T+l, P) . 

holds (closed, Tl, P) :- 

holds (neg (open) , Tl, P) , 
holds (neg (locked) , Tl, P) . 

e (closed, T+l, P) :- 

e (neg (open) , T+l, P) , 
e (neg (locked) , T+l, P) . 
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pc (closed, T+1,P) :- 

pc (neg (open) , T+l, P) , 
not holds (closed, T, P) , 
not e (open, T+l , P ) , 
not e (locked, T+l, P) . 

pc (closed, T+l, P) :- 

pc (neg (locked) , T+l, P) , 
not holds (closed, T, P) , 
not e (open, T+l , P ) , 
not e (locked, T+l, P) . 

holds (neg (locked) , Tl, P) :- 
holds (open, Tl, P) . 

e (neg (locked) , T+l, P) :- 
e (open, T+l, P) . 

pc (neg (locked) , T+l, P) :- 
pc (open, T+l, P) , 
not holds (neg ( locked) , T, P ) , 
not e (neg (open) , T+l , P ) . 

holds (neg (locked) , Tl, P) :- 
holds (closed, T1,P) . 

e (neg(locked) ,T+1,P) :- 
e (closed, T + l, P) . 

pc (neg (locked) , T+l, P) :- 
pc (closed, T+l, P) , 
not holds (neg ( locked) , T, P ) , 
not e (neg (closed) , T+l, P) . 

holds (locked, Tl, P) :- 

holds (neg (open) , Tl, P) , 
holds (neg (closed) , Tl, P) . 

e (locked, T+l, P) :- 

e (neg (open) , T+l, P) , 
e (neg (closed) , T+l, P) . 

pc (locked, T+l, P) :- 

pc (neg (open) , T+l , P ) , 
not holds (locked, T, P) , 
not e (open, T+l, P) , 
not e (closed, T+l, P) . 

pc (locked, T+l, P) :- 

pc (neg (closed) , T+l, P) , 
not holds (locked, T, P) , 
not e (open, T+l , P ) , 
not e (closed, T+l, P) . 
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%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% GOAL REPRESENTATION 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

goal(Tl,P) :- 

holds (locked, T1,P) . 

goal(Tl,P) :- 

contrary (L, LI ) , 
holds (L, Tl, P) , 
holds (LI, T1,P) . 

:- used(h+l,P) , 

not goal (h+1, P) . 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% DOMAIN INDEPENDENT RULES 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% Rules encoding the effects of non-sensing actions 
holds (L, T+l, P) :- 

e (L,T+1,P) . 

holds (L, T+l, P) :- 

holds (L, T, P) , 
contrary (L, LI ) , 
not pc (LI, T+1,P) . 

% Inertial rules for sensing actions 
% Cannot branch to the same path 
:- PI < P2, 

P2 < P, 

br (G1,T,P1,P) , 
br (G2, T, P2, P) . 

Gl != G2, 

PI <= P, 

br (G1,T,P1,P) , 

br (G2, T, PI, P) . 

:- PI < P, 

br (G, T, PI, P) , 
used (T, P) . 

used(T+l,P) :- 
PI < P, 
br (G,T,P1,P) . 

holds (G, T+l, P) :- 
PI <= P, 
br (G, T, PI, P) . 

holds (L, T+l, P) :- 
PI < P, 



ASP with Sensing Actions, Incomplete Information, and Static Causal Laws 69 



br(G,T,Pl,P) , 
holds (L, T, PI) . 

% Rules for generating action occurrences 
l{occ(X,T,P) :action(X) }1 :- 

used (T, P) , 

not goal (T, P) . 

: - occ (A, T, P) , 

not poss (A, T, P) . 

% Auxiliary Rules 
literal (F) . 
literal (neg (F) ) . 

contrary (F, neg (F) ) . 
contrary(neg(F),F) . 

new_br(P,Pl) :- 
P <= PI . 

used (1,1) . 
used(T+l,P) :- 

used (T, P) . 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% HIDE/SHOW ATOMS 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
hide . 

show occ (A, T, P) . 
show br (G, T, P, PI) . 



Smodels Output 

$ lparse -c h=2 -c w=3 examples/ex2 . smo | smodels 

smodels version 2.28. Reading ... done 
Answer: 1 
Stable Model: 

br (open, 1 , 1 , 2 ) occ ( check, 1 , 1 ) br (closed, 1 , 1 , 1 ) 

br(locked,l,l,3) occ ( f lip_lock, 2,1) 

True 

Duration: 0.020 

Number of choice points: 2 

Number of wrong choices: 

Number of atoms: 313 

Number of rules: 8 93 

Number of picked atoms: 257 

Number of forced atoms: 31 

Number of truth assignments: 4052 

Size of searchspace (removed) : 12 (65) 
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Cmodels Output 

$ lparse -c h=2 -c w=3 examples/ex2 . smo | cmodels 

cmodels 

cmodels version 3.01 Reading ... done 
Program is not tight. 
Calling SAT solver mChaff... 
Answer: 1 

Answer set: br (open, 1 , 1 , 3 ) occ ( check, 1 , 1 ) br ( closed, 1 , 1 , 1 ) 
br(locked,l,l,2) occ( f lip_lock, 2,1) 
Number of Loop Formulas 6 
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